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Smallpox should be saved 


Secure virus stocks in the United States and Russia may still prove useful and should not be 
destroyed. A political compromise is the best way to make that happen. 


history. The variola virus that causes it last swept through 

humans in a natural outbreak in Somalia in 1977, and the world 
was declared free of smallpox in 1980. The disease that killed Queen 
Mary IJ of England, Tsar Peter II of Russia, King Louis XV of France 
and hundreds of millions more during the past century alone is gone 
— but not forgotten. Smallpox lives on in the memories of those who 
witnessed its awful impact first hand. It is a terrifying spectre for those 
who warn that terrorists may seek to spread disease. And it survives in 
two laboratories, where research continues on live virus. 

The fate of these known virus stocks, held in secure laboratories in 
the United States and Russia, is once again the subject of debate. In 
May 2011, the World Health Assembly — the decision-making body 
of the World Health Organization — will vote on whether to set a date 
by which these collections should be destroyed. They should not be 
eliminated, at least not completely. 

This journal has previously argued that, because the possibility 
cannot be ruled out that other, secret stocks of smallpox are being 
held elsewhere, the benefits of continued access to live virus stocks 
for research outweigh the risks of maintaining them. In 1999 (Nature 
398, 733; 1999), we wrote: “Rightly, public-health advocates bemoan 
the prospect of any measure that increases the risk of a re-emergence 
of this scourge. But, given the impossibility of knowing who now pos- 
sesses the virus, and from where it might appear, it is better to have a 
number of arrows in the quiver than to destroy the stock and cross our 
collective fingers.’ The world has changed much since then, and the 
US terrorist attacks of September 2001 and the subsequent focus on 
prospects for bioterrorism have increased the stakes further. Small- 
pox would be an effective weapon — it spreads easily and kills almost 
one-third of the people it infects. Furthermore, the triumph of small- 
pox eradication after widespread vaccination in the 1960s and 1970s 
means that some 40% of the world’s population has no immunity. 

In an essay published online earlier this month in the journal Bio- 
security and Bioterrorism: Biodefense Strategy, Practice, and Science, 
security expert Jonathan Tucker argues that many of the stated goals of 
the World Health Organization's smallpox research programme have 
been achieved — such as antiviral drugs and diagnostic tools — and 
that there is a diminished need for live virus to be retained (J. B. Tucker 
Biosecur. Bioterror. doi:10.1089/bsp.2010.0065; 2011). This may be 
true, but further study of the virus could still reveal a huge amount, 
both on the specifics of what makes it such a formidable foe and on 
human immunology and viral pathogenesis in general. 

The scientific case for retaining live variola virus to improve public 
health is strong. The risk of doing so is largely political. The threat 
of an accidental release or theft from the biosecure repositories at 
which it is held — at the Centers for Disease Control and Preven- 
tion in Atlanta, Georgia, and the State Research Center of Virology 
and Biotechnology (‘Vector’) near Novosibirsk — seems remote. 


F: much of the world’s population, smallpox is a disease of 


Destruction of the stocks would be a largely symbolic step. 
Countries in Africa and Asia endured the havoc of smallpox most 
recently, and fear ofa repeat outbreak — as well as mistrust of those who 
control the stocks — has produced strong calls from these nations for its 
destruction. There is equally strong insistence in countries such as the 
United States and Russia that it should be retained for defensive research. 
Ina recent article in The New York Times, former US government 
heavy-hitters Kenneth Bernard and Richard 


“Smallpox is Danzig call elimination of the virus stocks a 
a disease of “bad idea”. They say: “We have an obligation to 
history, but ensure that our children’s children never have 
it cannot be to worry about seeing this ancient scourge kill 
consigned to yet again. Destroying the remaining research 
the past. ” specimens of smallpox at this time will prevent 


us from fulfilling this obligation” 

Tucker warns that the result of this political stand-off, further compli- 
cated by the rare involvement of security and defence officials in a public- 
health matter, could be a “diplomatic train wreck” — particularly given 
that the World Health Assembly will seek a consensus decision. As part 
of acompromise solution, he suggests that the United States and Russia 
should agree to destroy part of their collections, such as hybrid viruses 
that are scientifically redundant. A handful of strains would continue 
to be held at each site, for research in the short term and reference after- 
wards. That seems a sensible approach. True, the world will be denied a 
final victory in the heroic effort to defeat smallpox, but uncertainty over 
undeclared surviving stocks would make that a hollow accomplishment 
anyway. So, too, would development of molecular-biology techniques 
that could see the smallpox virus reconstructed. Smallpox is a disease 
of history, but it cannot be consigned to the past. m 


Goodbye Tevatron 


The decision to shut down the ageing particle 
collider at Fermilab is the right one. 


(): 10 January, the US Department of Energy (DOE) 


announced that the United States will close its only particle 

collider, the Tevatron, located at Fermilab in Batavia, Illinois. 
The decision to shutter the decades-old collider this year (see Nature 
469, 141; 2011) goes against the recommendations of the US parti- 
cle-physics community’s scientific advisory group, which believes 
that the machine still has work to do. But given the financial con- 
straints facing Fermilab, and its ambitions for the future, the choice 
is at once courageous and correct. Moreover, it should serve as an 
example to other scientific communities that are clinging to ageing 
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infrastructure at the expense of the next generation. 

As is often the case in high-energy physics, the success of the Teva- 
tron can be measured in orders of magnitude. Since the Tevatron’s 
opening in 1985, its luminosity, a measure of the number of collisions 
produced, has increased by a factor of 20 million. Its detectors have 
gathered data at an exponential rate, along the way discovering the top 
quark in 1995 and the tau neutrino in 2000. 

But few discoveries have emerged from the accelerator in the past 
decade. The search for heavier particles, such as the Higgs boson, a 
critical part of the mechanism for mass, uses higher luminosities and 
energies than the Tevatron could ever produce. To make up for its lack 
of power, the machine has been forced into a war of attrition: running 
for years beyond its expected lifetime in the hope of collecting enough 
data to spot something new. It is an inefficient way to do science. 

Despite its mounting obsolescence, scientists have continued to 
push to extend the Tevatron’s run. In the beginning, the case was made 
that it could yet beat its European rival, the Large Hadron Collider 
(LHC) at CERN, Europe's particle-physics lab near Geneva, Switzer- 
land, to the discovery of the Higgs. That argument held some weight 
while the European collider was facing early teething pains, but last 
year the LHC roared to life and is now churning out data at a stag- 
gering rate (see page 282). The latest scientific case for keeping the 
Tevatron running until the end of 2014 revolves around its ability only 
to plug gaps in the LHC’s work. 

Although scientifically persuasive, this argument must be measured 
against the US$35 million needed each year to operate the machine. 
That sum might seem small in comparison to the US research budget 
asa whole, but it represents roughly 10% of Fermilab’s annual operat- 
ing funds. Lab leaders would like to put the money elsewhere: faced 
with the truth that the Tevatron cannot run forever, they are hoping 
to develop a strong programme in neutrino physics. The new direc- 
tion will keep the scientists at Fermilab working for years to come and 
could even lead to future collider projects. 


To administrators at Fermilab and the DOE, the decision was clear: 
the lab must shut down the Tevatron in order to move forward. In one 
sense it isa risky strategy — to trade a steady income for the promise of 
better to come. Funds for the new programme are far from guaranteed 
and could yet be a tall order in the current economic climate. But in 
making the sacrifice, the lab showed that it is serious about its new 

neutrino programme. It is also providing its 


“It can serve political supporters with evidence that it is 
as an example ene as a responsible research body, some- 
to scientific thing that may help them win the additional 
communities money needed for the new programme. 

that are clinging The move stands in stark contrast to other 
to ageing fields that have chosen to protect the old 
° over the new. Some in the astronomy com- 
infrastructure munity have lobbied hard to keep ‘historic’ 
at the expense facilities such as the Arecibo Observatory 
of the next in the United States and Jodrell Bank in 


generation. z the United Kingdom. Elsewhere in Europe, 


geriatric research reactors and synchro- 
tron light sources are kept running more out of national pride than 
scientific need. 

The case for closing many of these facilities is less clear-cut than 
the Tevatron, however. The collider has obviously been superseded 
by its European rival, and high-energy physicists have little choice 
but to move on. By contrast, telescopes and synchrotrons can always 
gather more data on another star or a new material. Yet at some stage, 
the scientific value of these facilities becomes so diminished that it 
must be weighed against the new opportunities the funding could give 
to the community. Rather than fighting protracted political battles 
against their inevitable closure, researchers should provide new ideas 
and facilities that could be leveraged using funds from the closure of 
the last generation. Sucha strategy is more risky, but it will ultimately 
prove much more rewarding. = 


Natural wealth 


Ecological models can be used to guide 
economic policy — but should they? 


there exists an equal and opposite economist. In that spirit, 
Nature this week presents related articles that take contradic- 
tory stances on what the banking system could, or indeed should, 
learn from ecologists. In a Perspective (see page 351), Andrew 
Haldane, executive director of financial stability at the Bank of 
England in London, and Robert May, a theoretical ecologist at the 
University of Oxford, UK, and former chief scientific adviser to the 
UK government, argue that the stability of complex networks of linked 
dependencies that make up the world’s financial system can be tested 
using a simplified ecological model. They say that the results of their 
approach highlight ways in which economic policy could be changed 
to make the system more secure. Regulation of elements such as liquid- 
ity ratios, banking structures and trade in complex derivatives, they 
say, should work to protect the system rather than individual banks. 
In a News & Views Forum (see page 302), this approach is criticized 
as potentially dangerous by Neil Johnson, a physicist at the University 
of Miami in Florida who studies real-world complex systems. Johnson 
says that policy implications drawn from sucha simplistic model will be 
unreliable, because they will depend so heavily on the assumptions used 
to prepare the model. But Thomas Lux, an economist at the University 
of Kiel in Germany, offers a more supportive view. He says that the simi- 
larities between financial markets and ecology are real and relevant. For 
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example, the 2008 default of the Lehman Brothers financial services firm 
had contagious effects similar to those of a ‘super-spreader of disease. 
Researchers should go beyond Haldane and May’s simple model, says 
Lux, and build in factors such as interbank credit lines. 

Given such contrasting views, how seriously should policy-makers 
take lessons from nature when it comes to financial regulation? Using 
natural models to set policy has proven difficult, even for ecological pol- 
icy. Witness the controversy that has surrounded the UK government's 
handling of foot-and-mouth outbreaks and whether to tackle bovine 
tuberculosis by culling badgers, which can spread the disease. In other 
areas, such as conservation and fisheries, research offers more clear-cut 
recommendations. But in such cases there is often a failure of political 
will to follow through. Which is the case in finance policy? Perhaps the 
best approach is to keep a cautiously open mind — or to remember that 
the second law of economists states that both the economists cited in 
the first law are wrong. 

Wider discussion of the ideas raised by Haldane and May will inevi- 
tably throw up many questions. Among them, we expect, will be: why 
is Nature publishing research in economics? As a natural-sciences 
journal, most of economics falls outside our primary-research remit, 
and there are some fundamental differences in approach between the 
fields. For example, in financial systems, as Haldane and May point 
out, evolutionary forces often act for the survival of the fattest, rather 
than the fittest. But in this case, we think the clear links that the authors 
draw with ecology will make their study of interest to readers. More 
generally, we are happy to publish primary economic research that 
has significant scientific implications, for exam- 
ple in fields such as behaviour, conservation 
biology, systems biology or physics. With such 
an approach, perhaps both economic science 
and natural science can benefit. m 
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Amazonian Ecuador that is home to the greatest concentration 

of species on the planet? Ecuador’s government has asked for 
US$3.6 billion to refrain from developing the region, in the Yasuni Bio- 
sphere Reserve, but the reaction of the world suggests that the price is too 
high. We need to accept that big oil firms are coming to Yasuni, and the 
struggle now must be how to limit the damage, rather than prevent it. 

The effort to save the region mirrors the plot of 2009’s blockbuster 
film Avatar. Yasunt is the set and its ancestral people are the Waorani. 
The precious commodity beneath their feet is fossil fuel. The region 
and its people are highly threatened by oil extraction and the col- 
lateral damage that such development brings. For most of the past 
half-century, Ecuador’s economy has depended heavily on oil revenue. 
Oil-field development continues to push into more remote areas to 
keep the cash flowing. It has already reached into 
the heart of the Yasuni reserve. 

In an innovative proposal set out in 2007, 
President Rafael Correa offered to sell the world 
a guarantee that oil would not be extracted from 
the Ishpingo-Tambococha-Tiputini (ITT) block, 
a 1,200-square-kilometre portion of Yasuni on the 
country’s eastern border with Peru. The $3.6-billion 
price is about half of what Ecuador could expect to 
make from the roughly 850 million barrels of crude 
oil reckoned to be there. In exchange, Correa said 
that the area would be legally set aside, and become 
off-limits to extraction in perpetuity. In Ecuador, 
the plan is optimistically referred to as Plan A. 

Two decades in Yasuni have infused me with an 
intense desire for this initiative to work. Unfortu- 
nately, after three years of worldwide campaigning, 
it is heartbreaking to say that I don't see it happening. All the 
environmentalists who see the initiative as Yasunfs last chance don't 
want to believe it might fail. Out of fear that we might jinx it, most of 
us haven't even been willing to hint at its possible failure. But the trust 
fund set up to receive the money contains just $100,000 so far. Correa 
says he will evaluate its accumulation rate in a few months. If more 
money does not come, he could withdraw the offer. 

Why the poor response? Some people suggest that his initiative is 
essentially an ugly ultimatum — either the world pays to save Yasuni or 
Ecuador will have no choice but to exploit the oil beneath it. In this sce- 
nario, the Ecuadorian government can blame the rest of the world for 
any negative consequences. Ecuador’s vice-president, Lenin Moreno, 
denies the reserve is being held for ransom, and points out that his 
country would be the greatest contributor, as it is 


H= much is the world willing to pay to protect a piece of 
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& Day of reckoning for 
Ecuador’s biodiversity 


The world’s indifference to a request for $3.6 billion to preserve a diversity 
hot spot may push the country to extract oil there, says Kelly Swing. 


development that would decrease the country’s energy footprint and 
boost conservation efforts. Does the world believe that Ecuador can 
be trusted to use the funds in this way and to keep its promises — not 
only for the next decade but forever? Realistically, no conservation 
plan can demand that any sovereign nation be permanently deprived 
of rights to develop and profit from its own natural resources. 

We have to realize that Plan B, to extract and sell the oil, is ready to be 
set in motion. Historically, such development north of the Napo River 
has led to widespread oil spills and surface dumping of formation water, 
and the opening of access roads has been followed by waves of settlers, 
deforestation, expansion of the agricultural frontier, acculturation of 
indigenous groups, timber harvest and the marketing of bushmeat, 
resulting in near complete destruction of that region. This same process 
has already compromised the accessible western parts of Yasuni. 

If the crude oil in the ITT block and surround- 
ing region is exploited using the same methods, the 
result will be nothing short of a world-class envi- 
ronmental disaster. If an earthquake, hurricane 
or bomb caused the same amount of destruction 
in an instant, no one could ignore the impact, or 
would simply stand by and accept it. Because this 
tremendous loss would be stretched out over dec- 
ades, few are likely to notice the gradual change. 

Does this have to be the future? Oil companies 
say they can do better, if given the chance. The oil 
industry has improved its strategies and technolo- 
gies to minimize risks, but minimizing risks is not 
the same as eliminating them. Just think back on 
BP’s summer of oil in the Gulf of Mexico. 

If extraction is deemed necessary and unavoid- 
able in the ITT (and probably most of the Yasuni 
reserve), any plan must require that the job genuinely be done in an 
environmentally sound way. This can be done only if the oil is extracted 
without new roads, which open the way to so many destructive forces. If 
roads continue to be built to provide access then, without question, all is 
lost. Instead, the oil must be extracted using ‘off-shore’ strategies, which 
are more expensive but do less damage. It goes without saying that any 
operations would have to include proper maintenance and independ- 
ent monitoring, but having isolated oil platforms scattered across the 
landscape has to be better than horizon-to-horizon deforestation. 

Correa’s government has already indicated that it would sacrifice 
revenues of $3.6 billion to save a small portion of this regional biodi- 
versity hot spot. A simple policy of no more roads built in the reserve 
would go a long way to protecting much more of Yasuni, well beyond 
the bounds of the ITT block, and for far less money. m 


Kelly Swing is founding director of the Tiputini Biodiversity Station, 
University of San Francisco de Quito, Ecuador. 
e-mail: kswing@usfq.edu.ec 
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Robo protein guide 
for cell transplants 


A protein used by blood stem 
cells to adhere to bone marrow 
could offer a way to improve 
the success of blood stem-cell 
transplants. 

Certain illnesses, such as 
some blood cancers, have long 
been treated with blood — or 
haematopoietic — stem-cell 
transplants, but the procedure 
is risky. To learn more about 
how to manipulate these 
cells, Camilla Forsberg at 
the University of California, 
Santa Cruz, and her colleagues 
studied the Robo4 protein. 

They found that, in mice, 
haematopoietic stem cells 
lacking Robo4 were fewer 
in number, and less able to 
anchor themselves in the bone 
marrow after transplantation. 
The authors also revealed that 
Robo4 works with another 
protein, Cxcr4, to localize 
transplanted stem cells to the 
bone marrow. 

Cell Stem Cell 8, 72-83 (2011) 


Gas keeps 
drag low 


The flow of water on solid 
surfaces is significantly 
impeded by frictional forces 
— which is bad news for, 

say, marine vehicles. A gas 
layer can be introduced at 

the solid-liquid interface as 

a lubricant, but even slight 
hydraulic pressure can destroy 


Pandas prefer their trees mature 


Giant pandas are famous for their love of 
bamboo, but it seems that they also have a 
preference for old forests, suggesting that 
maintaining such habitats is important for 
conserving this iconic endangered species. 
Fuwen Wei at the Chinese Academy of 
Science in Beijing and his colleagues analysed 
data collected between 1999 and 2003 on 4,908 
plots of land in Sichuan province for signs of 


this layer. Choongyeop Lee 
and Chang-Jin Kim at the 
University of California, Los 
Angeles, have devised a way to 
keep the gas layer intact and 
cut drag even in underwater 
conditions. 

The duo began with a 
highly hydrophobic surface 
studded with 50-micrometre- 
high pillars and gold-coated 
nanostructures (pictured), 
and submerged this in water. 
The gold coating allowed an 
electrolytic reaction to occur, 
generating gas at its surface 
when water made contact. 
Bubbles formed only in areas 
where there had been no gas 
before, and because of the 
surface’s architecture, the 
bubbles spread uniformly 
across the surface. 

Phys. Rev. Lett. 106,014502 (2011) 


268 | NATURE | VOL 469 | 20 JANUARY 2011 
© 2011 Macmillan Publishers Limited. All rights reserved 


Stronger shields 
against the flu 


People’s immune responses 
to infection with the 2009 
HIN1 influenza pandemic 
virus included neutralizing 
antibodies effective against 
many flu virus strains. 

Rafi Ahmed at Emory 
University in Atlanta, Georgia, 
Patrick Wilson at the University 
of Chicago in Illinois and their 
group analysed 86 antibodies 
made by immune cells from 
nine patients infected with 
HIN1. Surprisingly, 63% of the 
antibodies bound to flu viruses 
that had appeared before the 
2009 pandemic. Some even 
worked against the 1918 
pandemic virus and the H5N1 


pandas (Ailuropoda melanoleuca, pictured). 
They found that the presence of old-growing 
forests was just as good an indicator that pandas 
live in the area as the presence of bamboo. 

The authors suggest that maps and models of 
suitable habitats for pandas should be revised 
to prioritize old forests. They also call on the 
Chinese government to renew its ban on logging. 
Biol. Lett. doi:1011098/rsbl.2010.1081 (2010) 


avian influenza virus. Three 
antibodies were generated 
from patients’ cells and tested 
in mice. All three protected the 
animals against a lethal dose of 
HI1N1, whether given before 
or after exposure. Such broadly 
protective antibodies — long 
sought, but elusive — could aid 
the design ofa ‘universal flu 
vaccine, the team speculates. 

J. Exp. Med. doi:10.1084/ 
jem.20101352 (2011) 


The dance of 
three stars 


Researchers have spotted a 
cosmic ballet — two stars 
spinning around one another 
while both orbiting a larger 
star. The trio, named KOI-126, 


DLILLC/CORBIS 
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provided a rare opportunity to 
precisely measure the masses 
of binary stars. 

The triple system is about 
1,000 parsecs from Earth. The 
smaller duo have masses of 
about one-fifth and one- 
quarter that of the Sun, and 
orbit one another in 1.76 days. 
They take about 34 days to go 
around their larger partner, 
which has a mass 1.3 times that 
of the Sun. Joshua Carter of the 
Harvard-Smithsonian Center 
for Astrophysics in Cambridge, 
Massachusetts, and his team 
calculated the masses using the 
space-based Kepler telescope 
by observing the dimming of 
light as the binary stars passed 
in front of the larger star. They 
produced a model that fitted 
the data to determine the 
dynamics and characteristics 
of the low-mass pair. 

Science doi:10.1126/ 
science.1201274 (2011) 


| __NEUROSCIENCE 
Ringing in 
the brain 


People with tinnitus hear a 
ringing even when there is no 
sound. The prevailing theory 
blames a malfunctioning 
auditory system. However, 

it seems that abnormalities 

in the brain’s limbic regions, 
which determine which 
sensations are important and 
how they are experienced, may 
also be involved. 

Josef Rauschecker and his 
colleagues at Georgetown 
University in Washington DC 
used functional magnetic 
resonance imaging to scan 
the brains of 22 volunteers, 
half of whom had tinnitus, 
while they listened to various 
sounds. Patients with tinnitus 
showed heightened activity in 
the nucleus accumbens — a 
key limbic region — when 
presented with sounds that 
matched the frequency of the 
‘ringing’ in their ears. They also 
had anatomical differences in 
the ventromedial prefrontal 
cortex, another limbic area. 

The researchers suggest 
that an abnormal limbic 
system elevates the perceived 
importance of the tinnitus 


sound or fails to suppress it, 
and that interactions between 
the auditory and limbic 
systems may be at the root of 
this disorder. 

Neuron 69, 33-43 (2011) 


Promoter predicts 
drug results 


A clinical trial of a drug to 
treat behavioural features of 
the genetic disorder fragile 
X syndrome has revealed a 
genetic signature that seems 
to predict which patients will 
respond to the treatment. 

Fragile X syndrome results 
from excessive methylation of 
a regulatory section — called 
the promoter — of the FMR1 
gene. This silences the gene 
and increases signalling in the 
pathway for the brain receptor 
mGluRS5. The drug candidate, 
AFQ056, blocks mGluR5 
activation. 

Baltazar Gomez-Mancilla 
at the Novartis Institutes for 
Biomedical Research in Basel, 
Switzerland, and his group 
assessed the drug’s effects on 
the behaviour of 30 men who 
had either a fully or partly 
methylated FMR1 promoter 
region. Seven with a fully 
methylated promoter showed 
behavioural improvements 
after 19-20 days, whereas the 
behaviour of those with only 
partial methylation did not 
differ significantly between 
drug and placebo treatments. 
Sci. Transl. Med. 3, 64ra1 (2011) 


Relativity starts 
your car 


Relativity is generally invoked 
in lofty thought experiments 
often involving fast-moving 
spacecraft, but new work 
shows that it also applies to the 
everyday automobile. 

Cars are started using 
lead-acid batteries, which 
generate energy using 
electrochemical reactions 
between lead compounds and 
sulphuric acid. Rajeev Ahuja of 
Uppsala University in Sweden 
and his colleagues modelled 
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How one elephant became two 


5 HIGHLY READ 
on plosbiology 
org the weeks 
beginning 27 Dec 
and 3 Jan 


Whether savannah- and forest-dwelling 
African elephants belong to the same or 
different species has long been the subject 
of debate. An analysis of the creatures’ 
genetic ancestry confirms that the two 


species are separate. 

Nadin Rohland at Harvard Medical School in Boston, 
Massachusetts, and her co-workers sequenced the nuclear 
genomes of the forest elephant (Loxodonta cyclotis) and the 
savannah one (L. africana), as well as that of the Asian elephant 
(Elephas maximus). They also extracted and sequenced DNA 
from the extinct woolly mammoth (Mammuthus primigenius) 
and American mastodon (Mammut americanum) — ancient 
ancestors of today’s elephants. By comparing all of these 
genomes, the team found that the forest and savannah elephants 
diverged into separate species between 7.1 million and 
1.9 million years ago — much earlier than previously proposed. 


PLoS Biol. 8, e1000564 (2010) 


For more on this research, see go.nature.com/91a4zy 


the reactions and found that as 
electrons move at high speed 
around a lead nucleus, their 
energy levels change owing to 
relativity. The authors conclude 
that the change accounts for 
1.7-1.8 volts ofa standard 
2.13-volt lead-acid cell. 

Phys. Rev. Lett. 106,018301 
(2011) 


The many styles 
of sperm 


Of all the cell types in the 
animal kingdom, the most 
diverse are sperm, which can 
be adorned with tails, hairs, 
bristles and more. It seems 
that sex drives the evolution of 
sperm morphology. 

Lukas Scharer at the 
University of Basel in 
Switzerland and his colleagues 


watched 16 species of 
promiscuous hermaphroditic 
flatworm (Macrostomum), 
which have a variety of sperm 
shapes (pictured), mating 
under a microscope. After 
sex, some species suck out the 
ejaculate, possibly as a way 

of selecting which sperm are 
ultimately accepted. 

The researchers found that 
those species that exhibit this 
sucking behaviour have ornate 
sperm with features such as a 
pair of long bristles emerging 
at the mid-point and a tail 
resembling a paint brush. 
These appendages can become 
lodged in the female orifice 
after copulation, preventing 
the sperm from being sucked 
out. Species that don’t remove 
sperm have evolved simpler 
sperm that tend to be smaller 
and lack hairs or bristles. 

Proc. Natl Acad. Sci. USA 
doi:10.1073/pnas.1013892108 
(2011) 

For a longer story on this 
research, see go.nature. 
com/8jvjws 
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SEVEN DAYS sescnisi 


POLICY 


Haiti’s cholera fight 
Health officials have outlined 
plans for a proposed cholera 
vaccination programme 

in Haiti — but there is still 
tension over how large the 
effort should be. As the 
country struggles to recover 
from the earthquake that 
devastated it last January, 
some experts have pushed for 
asmall pilot project, but the 
government wants a larger 
programme to be rolled out as 
soon as possible. See page 273 
for more. 


NIH conflict clash 
The US National Institutes 
of Health (NIH) should 

issue regulations governing 
conflicts of interest for the 
institutions at which its 
grantees work, a report by 
the inspector general of the 
Department of Health and 
Human Services urges. The 
NIH is revamping rules 

that address the reporting 

of potential conflicts by 
individual investigators but it 
doesn't have analogous rules 
for universities and medical 
centres, even though these are 
required by law, the 10 January 
report notes. The agency says 
it is “carefully considering” 
comments it has received on 
institutional conflicts as it 
finalizes changes to the rules 
for individuals. 


French drug reform 
France's health minister Xavier 
Bertrand last week pledged to 
reform the country’s drug- 
regulation system in the wake 
of a damning official report 
into why the weight-loss 

drug Mediator remained on 
the market for years despite 
concerns over potentially lethal 
side effects. The AFSSAPS, 

the French state body that 
approves drugs for marketing, 
banned the drug only in 2009, 
even though questions were 


Permit for mountaintop mine revoked 


The US Environmental Protection Agency 
(EPA) has for the first time revoked a permit 
for mountaintop mining. Arch Coal, a mining 
company based in St Louis, Missouri, obtained 
the permit for what would have been the 
country’s largest mountaintop mine in 2007 
from the Army Corps of Engineers, which is 
responsible for developing and maintaining US 


raised about its impact on heart 
disease more than a decade 
ago, a delay that the report says 
may have contributed to some 
500 premature deaths. The 
AFSSAPS will now investigate 
76 other products, Bertrand 
says. 


NASA waste 


NASA could end up spending 
US$575 million on a space 
programme that has already 
been cancelled, the agency's 
inspector general has warned 
Congress. The US government 
is currently funded by a 
‘continuing resolution, which 
requires agencies to fund 
existing programmes at last 
year's level until Congress 
passes a new budget. This 
means that NASA has to fund 
Constellation, former president 
George W. Bush's programme 
to return to the Moon and 
reach Mars, at $200 million 
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a month until 4 March. 
Congress passed legislation 
to cancel the programme last 
October. See go.nature.com/ 
pshdef for more. 


} RESEARCH 
Hunger still rife 


Entire segments of the food 
production system have 

been damagingly neglected 

in international attempts to 
reduce hunger and poverty, 
according to a report from 
the Worldwatch Institute, 

an independent research 
organization based in 
Washington DC. The ‘State of 
the World’ report, published 
last week, says that previous 
approaches to feeding the 
world’s population have “not 
really worked”, given that 

925 million people worldwide 
still go hungry every day. 
Worldwatch calls for new 


water resources (see Nature 467, 1021; 2010). 
But the EPA has ruled that Spruce 1 Mine in 

the Appalachian Mountains of West Virginia 
presented major environmental and water- 
quality concerns, and would jeopardize the 
health of local communities. US mountaintop 
mines, such as the one on Kayford Mountain 
(pictured), are largely found in the Appalachians. 


approaches, such as growing 
a wider variety of crops and 
reducing reliance on chemical 
fertilizers. See go.nature.com/ 
bbkrbb for more. 


Climate records 


Last year was one of the two 
warmest years on record, 
according to the US National 
Oceanic and Atmospheric 
Administration (NOAA). Only 


NUMBER CRUNCH 


18% 


The number of health 
advocacy groups receiving 
funding from Eli Lilly in 
the first half of 2007 that 
acknowledged it in their 
annual reports. 

Source: Am. J. Public Health 
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2005 had an equivalent average 
global surface temperature. 
Global average temperatures 
for 2010 — which was also 

the wettest year on record — 
ran 0.62 °C higher than the 
twentieth-century average, 
according to a preliminary 
analysis from NOAA’ National 
Climatic Data Center in 
Asheville, North Carolina. 


Spill science scarce 


The presidential commission 
investigating last year’s huge 
oil spill in the Gulf of Mexico 
has called for more science 

in federal decisions on oil 
production and spill response. 
In its final report, released last 
week, the commission asked 
Congress to supply more 
funding for scientific and 
environmental studies and to 
involve science agencies more 
formally in decisions about 
which areas should be opened 
to exploration. Commission 
co-chair Bob Graham said: 
“Science has not been given 

a sufficient seat at the table. 
Actually, I think that’s a 
considerable understatement. 
It has been virtually shut out.’ 
See go.nature.com/skf5zx 

for more. 


After artemisinin 
At least US$175 million is 
needed to halt the spread 
of malaria parasites that are 
resistant to artemisinins, 
says the World Health 
Organization (WHO). Of 


TREND WATCH 


Even without the more expensive 
treatments for cancer that are to be 
adopted soon, the cost of caring for 
those with the disease will rapidly 


increase in the coming years. A 


team from the US National Cancer 


Institute in Bethesda, Maryland, 
modelled predicted changes to US 
population, cancer incidence and 
survival rates for the initial, final 
and continuing care stages of the 
disease and found that spending 
could rise by more than 20% by 
2020. Even a small cost increase 
on top of that will drive numbers 
much higher. 


this, some $60 million should 
be used to boost research 
activities including developing 
classes of antimalarials to 
replace the artemisinins, 
currently the most potent 
antimalaria drugs. Published 
last week, the WHO’ plan for 
containing resistance to the 
drugs also calls for increased 
monitoring, as only 31 of the 
75 countries that should be 
routinely testing the drugs’ 
efficacy did so in 2010. 


King Faisal prize 
Chemists George Whitesides, 
of Harvard University in 
Cambridge, Massachusetts, 
and Richard Zare, of Stanford 
University in California, have 
been announced as winners 

of this year’s King Faisal 
International Prize for Science. 
James Thomson, of the 
University of Bern, and Shinya 
Yamanaka, of the University of 
California, San Francisco, and 
Kyoto University, Japan, took 
the prize for medicine for their 
work on stem cells. Winners 
receive a medal and share 
US$200,000 in each category. 


ERC rings changes 
The European Commission 
has appointed seven members 
to the governing body of the 
European Research Council 
(ERC). The appointments 

to the 22-member Scientific 
Council will begin their 


2-year terms on 2 February. 
Fotis Kafatos, a molecular 
entomologist at Imperial 
College London who was the 
first president of the ERC, 

is among those departing. 
Arrivals include Nobel 
laureate Timothy Hunt 
(pictured), a biologist at the 
London Research Institute of 
Cancer Research UK. 


| BUSINESS 
BP in Russian deal 


BP is joining up with Russia's 
state oil company Rosneft to 
drill in the Arctic waters of 
the Kara Sea. The London- 
based company will take 9.5% 
of Rosneft’s shares and help 
its Moscow-based partner 
explore a 125,000-square- 
kilometre area that it 
compares in potential to the 
North Sea. Rosneft will get 
5% of BP’s ordinary shares, 
worth around US$8 billion. 
Environmentalists have 
protested against the deal as 
they grow increasingly vocal 
about the dangers of drilling 
in the Arctic. 


CANCER COSTS WILL KEEP CLIMBING 

A 2% increase per year in the cost of initial and end-of-life care could 
drive total cancer costs in the United States to US$173 billion by 2020, 
a 39% increase. Initial care costs for three cancers are shown here. 
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SEVEN DAYS | THIS WEEK | 


23-28 JANUARY 

The fifth annual Arctic 
Frontiers conference 
takes place in Tromsg, 
Norway. Policy experts 
and scientists will tackle 
the topic of Arctic 
tipping points. 
www.arctic-frontiers.com 


24-28 JANUARY 
Ata workshop in 
Chamonix, France, 
scientists at CERN 
will discuss whether to 
extend the run of the 
Large Hadron Collider 
to the end of 2012 in 
search of the Higgs 
boson. 
go.nature.com/udrnvx 


24 JAN-4 FEB 

The ninth session of 
the United Nations 
Forum on Forests — 
which officially kicks 
off the International 
Year of Forests, 2011 — 
takes place at the 

UN headquarters in 
New York. 
go.nature.com/potx5i 


Satellite sacking 


The chief executive of a 
leading German space 
company has been suspended 
as from 17 January for 
allegedly criticizing the 
European satellite-navigation 
system Galileo, which the 
company is helping to build. 
In a statement, OHB-System, 
of Bremen, said it saw “no 
alternative” to removing Berry 
Smutny “in order to effectively 
avert any further damage to 
the company”. Cables from 
the US Embassy in Berlin, 
obtained by WikiLeaks and 
publicized in the Norwegian 
newspaper Aftenposten, claim 
that Smutny said the Galileo 
project is a “stupid idea” and 

a “waste of money”. Smutny 
denies saying this. 


> NATURE.COM 
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NEWS IN FOCUS 


EARTH SCIENCE Antarctic drillers STEMCELLS Test regime for SPACE Fault holds up 
inch closer to breaking drugs sees disease recreated gravitational- wave 
through to lost lake p.275 ina dish p.279 mission p.280 


VISION There’s more to 
the eye than rods and 
cones p.284 


About 200,000 people in Haiti have been sickened by cholera since the outbreak began in October. 


PUBLIC HEALTH 


Cholera vaccine 
plan splits experts 


Opinion is divided over how to tackle the disease in Haiti. 


BY DAVID CYRANOSKI, PORT-AU-PRINCE 


arely heard in Haiti before October, 
Rives is now an insult that children 
fling at one another in the teeming 
camps that still house more than a million 
people displaced by last January's devastat- 
ing earthquake. Graffiti blames the disease 
on either the current administration — now 
in a contested election crisis — or the United 
Nations. The disease is as much a fixture in 
people's lives as the endless piles of rubble that 
remain uncleared a year after the quake. 
Last week, as the country remembered the 
230,000 people killed in the disaster, officials of 
international health agencies fine-tuned their 


recommendations for moving forwards with 
a large-scale cholera-vaccination programme. 
It is a controversial idea that, just months ago, 
with little vaccine available and the epidemic 
spreading rapidly, was shunned as impracti- 
cal and probably ineffectual (see Nature 468, 
483-484; 2010). Now, with emergency care 
centres in place, at least in the most heavily 
populated areas, health officials can finally 
look ahead and think about how a vaccination 
programme might combat a disease that has 
become entrenched in the country. 
However, Nature’s interviews with key 
partners in the proposed vaccination effort 
reveal significant disputes on how to proceed. 
Most experts in the international community 


recommend a limited pilot project that would 
determine whether to scale up and how to use 
cholera vaccines in future outbreaks elsewhere. 
The Haitian government, caught in a febrile 
political environment and fearful that those 
denied vaccination might feel resentful, is 
demanding immediate, broad coverage. 

With no recent exposure to cholera, Haiti’s 
population lacks natural immunity and the 
disease has spread quickly. Roughly 3,800 have 
died, with another 189,000 falling ill, since 
21 October, when cholera was first recognized 
as the culprit. At the end of October, a local 
medical aid agency, GHESKIO, supported 
by the UN Children’s Fund (UNICEF), pro- 
posed vaccinating children under five living 
in two slums that have not yet reported large 
outbreaks. “There are 200,000 people without 
any toilets. They collect it and dump it in the 
sea, says Jean-Claude Mubalama, UNICEF's 
chief of health in Haiti for the past five years. 
“Tf cholera arrives there, it will be very bad” 

The Haitian ministry of health (MSPP) 
and the World Health Organization (WHO) 
rejected the proposal, pointing out that not 
enough vaccine was available. They also feared 
that vaccination would foster a false sense of 
security, causing people to relax sanitary meas- 
ures; and that it would take resources away 
from treating the sick, or from vaccine drives 
against measles and other diseases. “The voice 
of reason was to focus on saving lives,” says Jon 
Andrus, deputy director of the Pan American 
Health Organization (PAHO), the WHO's 
regional office. “I had driven around Port-au- 
Prince and seen dead bodies in the street?” 

In December, however, an expert commit- 
tee convened by the WHO decided that vac- 
cination should be tried, partly because they 
had located extra sources of the only WHO- 
approved vaccine, Dukoral, an expensive two- 
dose vaccine made by Crucell, based in Leiden, 
the Netherlands. On 13 January, the expert 
committee, including representatives from 
the WHO, the US Centers for Disease Control 
and Prevention, the US National Institutes of 
Health (NIH), UNICEE the US National Vac- 
cine Program Office and others, held a tel- 
econference to fine-tune a vaccination plan 
that could form the basis of a more detailed 
WHO-coordinated campaign strategy. The 
committee is recommending a pilot project 
using the currently available 250,000-300,000 
doses of Dukoral, and the creation of a stock- 
pile of the vaccine for the future. > 
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| NEWS IN FOCUS 


> The vaccination effort “can’t be done 
nationwide and it won't have a major public- 
health impact’, says Andrus, but it could reveal 
just how effective the vaccine would be ina 
mass immunization of a population already 
widely affected by cholera. Dukoral has not 
been used on such a scale before, although 
studies of thousands of people have shown it 
to be about 80% effec- 


tive. The committee has “The 7 

not worked out where bacterium 
the campaign would be won't go 
focused. “You can find away. It has 
areas where cholera is established 
endemic, and that may _ itself.” 


give youa targeted popu- 

lation where it may have a larger impact,” sug- 
gests Médecins Sans Frontiéres epidemiologist 
Kate Alberti. 

The campaign could also help to reach the 
country’s remote rural populations, which have 
a higher mortality rate. Although vaccine drives 
in Africa and elsewhere have faced resistance, 
Haitian people are eager to be vaccinated, says 
Francois Lacapére, a vaccine expert for PAHO/ 
WHO in Haiti. Yet many Haitians are also scep- 
tical of aid agencies’ motives. Suggestions that 
foreigners accidentally introduced the disease 
(see “How did the outbreak begin?’) have given 
rise to unfounded rumours. Some people liv- 
ing in a camp that was once the Petionville golf 
course in Port-au-Prince, for example, make 
completely unsubstantiated claims that they 
have seen UN staff poisoning reservoirs in an 
attempt to further debilitate Haiti so that inter- 
national powers can take over. 


In October 2010, rumours quickly spread 
in the Haitian press that a Nepalese United 
Nations peacekeeping base was to blame for 
bringing cholera to the country. An Associated 
Press report of excrement from the base 
being dumped in the Artibonite river fuelled 
the controversy. On 1 November, the US 
Centers for Disease Control and Prevention 
(CDC) in Atlanta, Georgia, reported genetic 
analyses suggesting the strain was from 
southeast Asia, not the one more commonly 
seen in Latin America. On 7 December, 
epidemiologist Renaud Piarroux of the 
University of the Mediterranean in Marseilles, 
France, concluded in a report for the French 
foreign ministry that the cases originated 
from near the Nepalese peacekeeping 
base. “Il am certain that it started only a few 
days after a battalion of soldiers came from 
Katmandu, a city which was subjected to a 
cholera outbreak at that time,” he says. 

On 9 December, a more detailed genetic 


Even if the programme can win enough 
trust, using the world’s entire stockpile of doses 
would still leave most Haitians without vac- 
cine — a controversial prospect for the belea- 
guered government. Jean Ronald Cadet, the 
MSPP’s vaccination programme manager, says 
the country is “90%” ready to go ahead with 
a campaign — but not on the small scale the 
WHO-convened expert group envisages. 

Asked about the small pilot project proposed 
by the group, Cadet says “No way,’ shaking his 
head. He insists that Haiti would only consider 
starting to vaccinate with more than 1 million 
doses, with a goal of eventually reaching 6 mil- 
lion people. “It would depend on the pressure 
that the international community can put on 
manufacturers.” Who would pay for the doses? 
“The international community,’ he says. “They 
brought us cholera, they have to take responsi- 
bility for taking care of it.” 

But mass vaccination of millions of people 
would necessitate much more vaccine produc- 
tion. About 1 million doses exist of another 
vaccine, Shanchol, which might be approved 
for use by the WHO by March (it is already 
approved for use in India). If production of 
both vaccines went into overdrive, Lacapére 
estimates that about 5 million doses could be 
prepared annually. This availability would be 
dependent on an advance-purchase decision, 
and with a six-month lag time to delivery. 

Epidemiologist Renaud Piarroux of the 
University of the Mediterranean in Marseilles, 
France, says that if vaccination is going to be 
tried, it should be done on a large scale. “I 
think it can be helpful, but it should be given to 


CHOLERA SOURCE 


How did the outbreak begin? 


analysis, led by Matthew Waldor of Harvard 
Medical School in Boston, Massachusetts, 
upheld the CDC’s conclusions (C. S. Chin 

et al. N. Engl. J. Med. 364, 33-42; 2011). The 
analysis, based on samples from Haiti and 
strains from Latin America and Bangladesh, 
cannot definitively confirm a Nepalese 
source, says Waldor. The strain could have 
arrived with other south Asians, or from west 
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millions of people in order to expect a notable 
effect,” he says. But he doesn't see a large cam- 
paign as practical. “This will cost a lot and will 
require time to get a sufficient number of doses. 
I would prefer this money be used to improve 
water-supply networks and to reinforce sani- 
tation activities,” he adds. In an unpublished 
paper, Piarroux presents data ona large cholera 
outbreak in Darfur, Sudan, that happened just 
two years after a mass-vaccination programme, 
suggesting that any coverage might be of lim- 
ited duration. 

Others hope for a more aggressive approach. 
Matthew Waldor, an infectious-disease expert at 
Harvard Medical School in Boston, Massachu- 
setts, says public-health officials should consider 
trying Peru-15, a live attenuated vaccine being 
developed by a consortium including Harvard 
Medical School and the NIH. Peru-15 is not yet 
in phase III trials, but has been proved safe and 
effective in thousands of patients, Waldor says. 

Whatever approach is tried, one thing is 
certain: cholera is there to stay. It is likely that 
the bacteria now have a stronghold in Haiti’s 
water, says Alberti. “Then you have a con- 
stant transmission between humans and the 
aquatic environment.” With poor sanitation, 
little access to clean water and difficulties in 
reaching people to treat them — not least due 
to gang warfare in the slums — the country 

can expect repeated out- 


NATURE.COM breaks, Alberti says. 
Read more about Andrus agrees: “The 
Haiti's cholera bacterium won't go 
treatment centres: away. It has established 
go.nature.com/tyltvu itself” m 


Africa. “But the Nepalese hypothesis is fairly 
convincing,’ he says. 

The UN initially questioned whether it 
was important to pin down the origin of the 
cholera. But on 6 January, it appointed a 
committee of four scientists to investigate 
the claims. Waldor says that a definitive 
answer could come from sequencing 
about 20 strains — from Haiti, Nepal and 
West Africa, as well as more recent strains 
from Latin America and other south Asian 
countries — and by checking blood samples 
from Nepalese troops in Haiti for antibodies. 
“The truth is important,’ Waldor says, 
because it could help to understand the risk 
of cholera transmission from peacekeepers 
in the future. 

Piarroux says that the international 
community’s responsibility for the epidemic 
should fuel efforts to eliminate the disease. 
“We are responsible for importing cholera — 
we have to fight against it with an iron will.” D.C. 


ANTARCTIC RESEARCH 


IN FOCUS | NEWS 


Race against time for 
raiders of the lost lake 


Arguably the most exciting — and certainly the most controversial — scientific endeavour 
in Antarctica’s history is close to a breakthrough. 


BY QUIRIN SCHIERMEIER 


Russian drilling team is just metres 
A= from reaching the water surface 

of Lake Vostok, the largest and deepest 
of the freshwater lakes hidden beneath Antarc- 
tica’s massive ice sheet. 

The ambitious project, launched more than 
20 years ago, has been repeatedly delayed by 
technical glitches and funding problems (see 
Nature 464, 472-473; 2010). But Russian 
researchers, who on 2 January resumed drill- 
ing at a depth of 3,650 metres, believe that just 
20-40 metres or so of accretion ice — frozen 
lake water — now separate them from the 
lake’s liquid surface. “We can make it this time,” 
Valery Lukin, director of the Russian Antarctic 
programme, told Nature. 

But time is short. Although the drill can 
advance by about 3 metres each day, the team 
must call a halt by 6 February, when the last 
aircraft of the summer research season is 
due to leave the Vostok research station, 
about 1,300 kilometres from the South Pole 
(see ‘Drill for victory’). If they haven't reached 
the lake by then, they will have to wait until 
December to continue, Lukin says. 

The chance of sampling one of the last 
uncharted environments on Earth has excited 
researchers ever since the lake's existence was 
first mooted in the 1970s. Many are thrilled 
by the possibility of discovering evidence of 
unique life forms in the lake, which is thought 
to have formed as much as 35 million years 
ago. But others worry 
that the drilling effort 
could contaminate an 
untouched environ- 
ment. The lake may 
hold traces of ancient microorganisms that 
could reveal how life on Earth has adapted to 
extreme conditions. 

At the Vostok station, tension is rising with 
every passing day. The team hopes that a sen- 
sor attached to the drill head will signal contact 
with liquid water in the next few weeks. At that 
point, the drill will be stopped and extracted 
from the bore hole, thereby lowering the pres- 
sure beneath it and drawing water into the hole. 
This should prevent any of the silicone drill- 
ing lubricant from entering the lake, explains 


“This is as new 
and exciting as 
flying to Mars.” 


DRILL FOR VICTORY 


After boring through almost 4 kilometres of ice, 


researchers are on the verge of reaching Lake Vostok. i 
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Lukin. The rising water will rapidly freeze in 
the borehole, where drillers can extract it with- 
out penetrating the pristine lake. “If everything 
goes according to plan, we will re-core the hole 
in December and retrieve the frozen sample 
without polluting the lake water,” he says. 

The plan cleared a key hurdle last Novem- 
ber, when scientists with Russias Arctic 
and Antarctic Research Institute (AARI) in 
St Petersburg submitted a final environmen- 
tal evaluation of the project, approved by the 
Russian government, to the Antarctic Treaty’s 
environmental protection committee. The 
document addresses queries or objections 
previously raised by parties to the treaty, allow- 
ing sampling operations to begin 60 days after 
the final evaluation was circulated to them. 
Scientists contacted by Nature acknowledge 
Russia’s right to proceed as planned, but remain 
unconvinced that the sampling technology is 
as clean as is claimed. “From our experience 
there is no such thing as clean drilling,” says 
Jean Robert Petit, a glaciologist at the Labora- 
tory of Glaciology and Environmental Geo- 
physics (LGGE) near Grenoble in France. 

Lake Vostok has been totally isolated for 
almost 15 million years, and researchers sus- 
pect that it is virtually devoid of nutrients and 
organic carbon. Its chemistry, together with the 
cold, darkness and high water pressure, could 
mean that conditions there resemble those in 
the suspected ice-covered ocean on Jupiter's 
moon Europa. Many think that Lake Vostok’s 
water is unlikely to support life today, but that 
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the sediment or bedrock beneath might host 
microorganisms. That would feed hopes that 
something similar could be found on Europa. 

Sediments found in accretion ice extracted in 
previous years by a team from the LGGE con- 
tained the thermophilic bacterium Hydrogen- 
ophilus thermoluteolus (S. A. Bulat et al. Adv. 
Space Res. doi:10.1016/j.asr.2010.11.024; 2010), 
although this does not prove that there is life in 
the lake itself. 

If the Russians reach their goal, lake water 
samples will be analysed for genetic material 
at the AARI, where a state-of-the-art DNA 
sampling laboratory opened last November, 
says Lukin. The credibility of any findings will 
require meticulously documented decontami- 
nation procedures, says Martin Siegert, an Ant- 
arctic researcher at the University of Edinburgh, 
UK. “Scientists will rightly ask how contamina- 
tion has been avoided, for example during the 
ice core’s long journey up the borehole,’ he says. 
Lukin says that any traces of life found will be 
sent to foreign labs for independent verifica- 
tion. “We are prepared to do this properly.” 

Meanwhile, Russian scientists and engineers 
are laying plans to venture into the lake itself. In 
the Antarctic summer of 2012-13, they plan to 
send a swimming robot into the lake to collect 
water samples and sediments from the bottom. 
An environmental assessment of the plan will 
be submitted at the Antarctic Treaty’s consulta- 
tive meeting in May 2012. “We'd like to pursue 
this by all means,” says Lukin. “For us, this is as 
new and exciting as flying to Mars.” m 
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A facility at Shidongkou No. 2 Power Plant in Shanghai, China, scrubs carbon dioxide from flue gases for a cost per tonne of CO, that is far below prices elsewhere. 


Low-cost carbon-capture 
project sparks interest 


Consortium to determine whether price reductions seen in China can be applied abroad. 


BY JEFF TOLLEFSON 


he Shidongkou No. 2 Power Plant outside 
Shanghai, China, has hosted a parade of 


foreign visitors in recent months, from 
academics and industry officials to US energy 
secretary Steven Chu. All have had one question 
on their minds: have Chinese engineers turned 
a corner on carbon-capture technology? 

That question occupies a small but signifi- 
cant place in a package of clean-energy research 
initiatives expected to be announced this week 
as Chinese President Hu Jintao meets US Presi- 
dent Barack Obama in Washington DC from 
19 January. 

The buzz began in late 2009, after officials 
at the government-owned Huaneng Group 
opened a facility that captures some of the 
carbon dioxide emitted by the existing giant 
1,320-megawatt coal-fired Shidongkou power 
station. The system scrubs roughly 120,000 
tonnes of CO, a year from 3% of the facility's 
flue gases, but what has caught everybody’s 
eye is the cost that Huaneng quotes: a mere 
US$30-35 per tonne of CO,, including the fur- 
ther expense of purifying the captured gas for 
use in the food and beverage industry. 

That is far below the $100 or more typically 
estimated for first-generation projects to retro- 
fit existing power plants for carbon capture and 


storage (CCS) in the United States and Europe, 
and it is within the range of past carbon prices 
in the European Union emissions trading sys- 
tem. If similar cost reductions can be realized 
elsewhere, they could cut years off the timeta- 
ble for commercial introduction of retrofitted 
CCS technology, touted as a way to reduce the 
climatic impact of existing coal plants. Experts 
want to know how the Chinese facility is doing 
it, and whether the savings could be exported. 
During Hu’s US visit this week, officials are 
expected to announce an initiative that will see a 
consortium of government, academic and busi- 
ness interests from the United States and China 
conduct a coal-technology assessment that 
could provide some answers. “A lot of people 
want to know whether that work will translate 
into other markets, and I believe we'll be able to 
shed a lot of light on that question,’ says Julio 
Friedmann, carbon-management programme 
leader at Lawrence Livermore National Labora- 
tory in Livermore, California. Friedmann also 
serves as technical director for the US-China 
Advanced Coal Technology Consortium at West 
Virginia University in 
Morgantown, which was 


established last Septem- Read more 
ber as part ofan energy aboutcarbon 
partnership and will — sequestrationat: 


conduct the assessment. 
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The Shidongkou retrofit builds on the work 
of asmaller facility, installed at the Gaobeidian 
power plant in Beijing in 2008 (see Nature 454, 
388-392; 2008). Both installations use a com- 
mon CCS technology: CO,-rich flue gas from 
the plant is bubbled through a column contain- 
ing an amine-based solvent — in this case, a 
mixture of ethanolamine and additives — that 
reacts with the gas and takes up its carbon 
dioxide. The solvent is then heated to release 
the CO,, and the whole process starts again. In 
addition to the direct costs, this process is nor- 
mally expected to gulp 25% or more ofa plant's 
energy output when fully scaled up, making it 
a tough sell for power companies. 

Huaneng has not yet revealed all the tech- 
nical details of its CCS process. Huang Bin, 
head of Huaneng’s Research and Development 
Division in Beijing, says that the company has 
made unspecified changes in the design of the 
plant and the chemistry of the solvent, which 
increased the energy efficiency of the system by 
11-14% and reduced the cost of installation by 
a factor of 10 per tonne of CO,. 

“Huaneng is using a known process, but they 
seem to have found a way to do it much more 
economically,’ says David Mohler, chief tech- 
nology officer of Duke Energy, an electric utility 
company based in Charlotte, North Carolina. 
“What's the secret sauce?” he adds. 


HUANENG GROUP 


The cost that Huaneng quotes for cap- 
turing the carbon dioxide before purifica- 
tion — around $20 per tonne — is four to 
five times lower than anything anybody else 
is reporting, says Mohler. Duke Energy has 
a partnership with Huaneng and has been 
planning to analyse the cost of installing 
and running Huaneng’s technology at its 
Gibson Station power plant in Indiana; 
Mohler says that joining the consortium’s 
broader assessment will bolster the Gibson 
analysis and help put this technology in 
context with other options. 

However, Howard Herzog, a chemical 
engineer researching carbon sequestration 
at the Massachusetts Institute of Technology 
in Cambridge, says a deeper inspection of 
the Shidongkou facility might reveal that its 
secret comes down to things such as cheap 
labour and fewer regulatory burdens. “The 
fact that it’s cheaper in China doesnt impress 
me,’ says Herzog, who recently toured the 
facility. “Everything is cheaper in China.” 

Sarah Forbes, a carbon-sequestration 
expert at the World Resources Institute, an 
environmental think tank in Washington DC, 
says that Huaneng’s costs for carbon capture 
are in proportion with general costs in the 
Chinese coal industry, which tend to be about 
one-third of those in the United States. 

Ming Sung, who promotes US-Chinese 
business alliances in Beijing on behalf of the 
Clean Air Task Force in Boston, Massachu- 
setts, acknowledges that costs will be higher 
in places such as the United States, but says 
that the question is how much. Assuming that 
Huaneng has made some industrial progress, 
he says, its technology will probably still be 
cheaper than anyone else’s at this stage. 

That may not be enough to jump-start the 
technology, Forbes cautions. “Adding carbon 
capture to a power plant still comes with a 
significant cost and energy penalty” that will 
discourage adoption, she says, and the regu- 
latory drivers necessary to encourage or force 
take-up are still missing. 

Nonetheless, EmberClear, an energy com- 
pany based in Calgary, Canada, has licensed 
a suite of Huaneng’s clean-coal technologies 
— including the capture technology in place 
in Shanghai — for deployment in the West. 
Albert Lin, chief executive of EmberClear, 
says the cost reductions that Huaneng has 
achieved at Shidongkou are probably the 
result of a mixture of practice and Chinese 
economics. “Half of it is Chinese costs, but 
there are definitely improvements in how the 
technology is being used,” he says. 

To Friedmann, Shidongkou No. 2 has 
become a critical case study for exploring 
actual CCS costs, because of its scale and the 
real operational data that Huaneng is provid- 
ing. “There are not that many places where 
we are going to have the kind of data we are 
looking for,” says Friedmann. “They are the 
forerunners.” = 


IN FOCUS | NEWS 


France mulls embryo 
research reform 


Scientists and clinicians push for a clearer, more permissive 
law on human embryonic stem-cell work. 


BY DECLAN BUTLER 


rance’s tight restrictions on human 
eentsent stem-cell (ESC) research 

might soon be eased. As parliament 
prepares to debate the issue next month, 
Nature has learned that researchers calls for 
reform are garnering political support. 

Officially, research on human ESCs and 
embryos is banned in France. But under a 
2004 amendment to the country’s bioethics 
law, scientists can obtain dispensation for 
research that could lead to “major therapeu- 
tic progress” for serious diseases that resist 
other approaches. Those whose research fits 
the bill — about 30 research groups and 40 
projects so far — can carry out research on 
whole embryos, or on cell lines derived from 
embryos left over from in vitro fertilization 
(IVF). Creating embryos for research pur- 
poses is illegal in the country, a position that 
enjoys a broad consensus among scientists, 
politicians and public alike. 

Scientists concede that the 2004 compro- 
mise has been a great improvement on the 
previous outright ban. But it still left uncer- 
tainty over the regulatory status of ESCs in 
France. This uncertainty is a deterrent to 
foreign researchers and investment by com- 
panies, says Marc Peschanski, a neuroscientist 
working for INSERM, the national biomedical 
research agency, and head of the Institute for 
Stem Cell Therapy and Exploration of Mono- 
genic Diseases in Evry, outside Paris. Axel 
Kahn, a renowned INSERM geneticist and 
president of the University of Paris- Descartes, 
calls the current law “an intellectual absurdity 
anda legal quirk”. 

A broad consensus of researchers and 
clinicians is now urging the government to 
overturn the ban, and to explicitly authorize 
research on ESCs and whole embryos without 
the need for any special dispensation. 

The law is ripe for reform, says Philippe 
Menasché, a cardiovascular surgeon work- 
ing for INSERM at the Georges Pompidou 
European Hospital in Paris, where his team 
is researching stem cells as a potential therapy 
for heart disorders. Conservative politicians 
opposition to all forms of embryo research 
was so fierce in 2004 that most researchers 
were grateful for any progress, he recalls, but 


political and ideological resistance has now 
largely abated. 

The government’ draft revised bill, released 
last October, would maintain the existing 
system. But last week, a source close to the 
science ministry told Nature that the minis- 
try will back the explicit authorization of ESC 
research. After five years without any apparent 
abuses, politicians are more comfortable with 
the work, he says. The successful oversight of 
the national Biomedicine Agency, set up in 
2004 to regulate human embryology, genet- 
ics and IVE has had a critical role in inspiring 
confidence, he adds. 

The science ministry is holding off from 
taking a position on the more controversial 
issue of research on whole embryos, says the 
ministry source, preferring to wait until it has 
heard the views of parliament next month. 

Some scientists, including Menasché and 
Peschanski, say they would be satisfied with 

a more permissive law 


After five on only ESCs, a posi- 
years without tion that has support 
any apparent among parliamentar- 
abuses, ians, adds Menasché. 
politicians But other scientists, 
are more including Kahn, point 
comfortable out that ESCs are just 
with the work. one aspect of embryo 


research, and they are 
pushing for a more complete authorization 
that includes all forms of such research. This 
view was endorsed by the influential bipartisan 
Parliamentary Office for Evaluation of Scien- 
tific and Technological Options last July, and 
by the Conseil d’Etat, one of the three arms 
of the country’s supreme court system, in a 
May 2009 report requested by Prime Minister 
Francois Fillon. 

The outcome of next month’s debate is 
“unpredictable”, says Menasché. The govern- 
ment may decide to take the low-risk approach 
of maintaining the status quo, and any 
bill must also be approved by the highly 
conservative Senate. A key factor will be the 
recommendations ofa cross-party parliamen- 
tary commission set up to examine the draft 
bill. Having heard testimony from researchers, 
it is scheduled to release its report later this 
month. A source says that the commission is 
currently “very divided”. = 
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Cells snag top modelling job 


Heart disorder joins growing list of conditions getting the “disease in a dish’ treatment. 


BY EWEN CALLAWAY 


hen an unconscious 28-year-old 
woman with a rare heart disorder 
was rushed to hospital, surgeons 


saved her life by implanting a defibrillator. In 
time, a sample of her cells, alive and beating in 
a dish, could help to save other lives. 

This is the hope for patient-specific models 
of disease, which can be created by reprogram- 
ming the patient's cells in the lab into an undif- 
ferentiated stem-cell state and then converting 
them into the specialized cell types affected by 
the condition. A handful of such models have 
been created so far for diseases ranging from 
diabetes to rare neurodegenerative diseases 
(see ‘Dishy models’), with many more expected 
soon. Pharmaceutical companies are beginning 
to use these cells to identify effective drug treat- 
ments and to predict side effects that may only 
appear in a small subset of patients. 

“The patient's cells act exactly as the patient 
was acting when admitted to the hospital,” 
says Lior Gepstein, a stem-cell biologist at 
the Technion Israel Institute of Technology 
in Haifa, Israel. His team created induced 
pluripotent stem (iPS) cells from skin cells 
donated by the young woman, who suffers 
from a genetic form of long QT syndrome. 
They then reprogrammed the cells into heart- 
muscle cells called myocardiocytes, which can 
be used to study the disease. Their work was 
published in Nature’ on 16 January. 

Long QT syndrome — which gets its name 
from a telltale anomaly in a patient's electro- 
cardiogram — is caused by delayed electrical 
recharging in heart-muscle cells. The condi- 
tion typically stems from inherited mutations in 
molecular channels that pump ions in and out 
of cells. People with long QT often die suddenly 
from arrhythmias, a kind of irregular heartbeat, 


DISHY MODELS 
Induced pluripotent stem cells are being used to 
mimic a growing number of diseases. 


Condition Cells of interest 
Amyotrophic lateral | Motor neurons? 
sclerosis 

Spinal muscular Motor neurons* 

atrophy 

Familial eural crest precursors” 
dysautonomia 

Long QT Myocardiocytes!* 
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Fanconi anaemia Haematopoetic progenitors 


Rett syndrome eural precursors, neurons’ 


in their 20s and 30s. No good animal models 
of the condition exist, says Gepstein, because 
rodent hearts beat many times faster than the 
human heart and use different ion channels. 

Heart-muscle cells created from the patient’s 
iPS cells, however, recreated several aspects 
of her disease in a Petri dish. The cells didn't 
recharge as quickly as heart-muscle cells made 
from ‘healthy’ iPS cells, for example. An earlier 
patient-specific model of a different form of 
long QT syndrome found similar results”. Both 
the ‘healthy’ and the long QT patient-derived 
heart-muscle cells formed cell sheets in the Petri 
dish that developed a rudimentary ‘heart beat. 
But the patient’s cells pulsed irregularly. “The 
normal cells ofa healthy individual went boom- 
boom-boonm; of the long QT patient they 
went boom-boom-boomboomboomboom, 
Gepstein says, doubling his tempo. 

Three medicines that help heart cells recharge 
by targeting different kinds of ion channels all 
prevented arrhythmias in the diseased cells, 
proving the model's potential for drug screening. 
Moreover, the long QT patient's cells beat more 
irregularly when Gepstein’s team treated them 
with a gastric-acting drug, cisapride, which 
was pulled from the market in 2000 because it 
caused lethal arrhythmia in some patients. 

Predicting such side effects may be the first 
application for cells from long QT patients, says 
Gepstein. But he thinks that the eventual goal 
of patient-specific disease models should be 
personalized medicine. Clive Svendsen, direc- 
tor of the Cedars-Sinai Regenerative Medicine 
Institute in Los Angeles, agrees: “You could 
prescribe the best drug for a patient without 
rotating through different drugs and without 
the danger of multiple side effects.” 

Christine Mummery, a stem-cell biologist 
at the University of Leiden in the Netherlands, 
disagrees, saying that making personalized dis- 
ease cells is too time-consuming and expensive 
to treat individual patients. But testing drugs 
against disease cells from numerous patients 
and comparing the cells’ genetics with their 
responses could reveal markers that could guide 
treatment more generally, she says. 

First, though, scientists need to overcome 
a shortcoming of the disease-in-the dish 
strategy: many patient-specific iPS cell lines 
don’t show any obvious defects related to a 
disease after conversion into the relevant cell 
types. Gepstein’s team 


chose long QT syndrome DNATURE.COM 
because it stems from  Foravideoof the 
genetic mutations that _ heating cells, see: 
lead to an obvious change _go.nature.com/xfwdex 


Heart cells derived from induced pluripotent stem 
cells provide a useful disease model. 


in the heart cells. Other conditions successfully 
modelled from a patient's cells, such as spinal 
muscular atrophy and Fanconi anaemia, also 
come with obvious genetic causes and clear 
physiological readouts. “These are all low- 
hanging fruits,” says Mummery. 

Mike Venuti, president of biotech company 
iPierian in San Francisco, California, says that 
complex diseases could be modelled if the con- 
dition has a distinct effect on cells that is not 
present in healthy comparisons. For instance, 
insulin-secretion defects in patient-specific 
pancreatic B cells could bea proxy for diabetes. 
His firm has made iPS cells from people with 
Alzheimer’s disease, Parkinson’s disease and 
type 2 diabetes and converted them into vari- 
ous cell types for drug screening. He expects 
that drugs identified using this method will 
reach clinical trial for conditions such as spinal 
muscular atrophy in the next few years. 

For all the enthusiasm about disease models 
made from iPS cells, embryonic stem cells (ES 
cells) shouldn't be forgotten for drug screens, 
says Stephen Minger, at GE Healthcare in Car- 
diff, UK. His firm is developing drug-safety 
tests — including for heart arrhythmias — 
using ES cells. He adds that the genetic manip- 
ulations used to make iPS cells could harm 
them and cloud the search for disease traits. “I 
think the field has gone a little bit overboard on 
iPS cells. I think they have tremendous poten- 
tial, but ES cells do as well. = 
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ASTROPHYSICS 


Missing part delays space mission 


Schedule slips for European-led effort to blaze a trail for gravitational-wave detection. 


BY EUGENIE SAMUEL REICH 


on the LISA Pathfinder mission, the 

situation is excruciating. Nearly all the 
instruments for the €300-million (US$400- 
million) spacecraft have been delivered for what 
was originally to have been a launch this year. 
But delays have pushed that target to 2013 and 
possibly later, with everything now held up bya 
small but crucial component. “All the rest is wait- 
ing for one part. It’s heartbreaking?’ says Vitale, a 
physicist at the University of Trento in Italy. 

It is a rougher-than-anticipated start for a 
mission that was created to find obstacles. LISA 
Pathfinder is a European-led test of the tech- 
nology needed to run the Laser Interferometer 
Space Antenna (LISA), an ambitious effort to 
detect gravitational waves from sources in the 
distant Universe. Scientists hope that LISA 
can achieve this by measuring the precise 
separations between three pairs of masses 
free-floating inside three spacecraft positioned 
5 million kilometres apart. The technical chal- 
lenge along with the estimated cost of LISA 
(€1 billion to €2 billion) made a precursor mis- 
sion a necessity. If LISA Pathfinder encounters 
significant problems it could sow doubts about 
the overall effort. 

LISA Pathfinder is not expected to detect 
gravitational waves, but it must deploy and 
measure the relative positions of two test 
masses with sufficient precision for LISA to 
move forward. The missing piece of the mis- 
sion is part of a ‘caging mechanism consisting 
of two sets of eight fingers that will hold the two 
1.96-kilogram gold—platinum masses during 
launch, and then, once the spacecraft reaches 
its orbit at the L1 Lagrangian point where the 
gravitational pull of Earth and Sun are balanced, 
delicately release them. The masses will then 
float freely inside their separate compartments 
while the spacecraft uses electrical microthrust- 
ers to maintain its position so precisely that the 
masses do not hit the sides of their containers. 

The exacting requirement for a mecha- 
nism that can hold the masses firmly enough 
to withstand a force of 2,000 newtons but still 
release them without imparting a velocity of 
more than 5 micrometres per second (18 milli- 
metres an hour) lies at the heart of the delay. A 
first prototype of the motor powering the fin- 
gers failed key tests, prompting the European 
Space Agency (ESA) to set up a task force to 
look into the problem. The motor is now being 
redesigned from scratch. “Little by little, the 
launch date is slipping,’ says Pierre Binétruy 


| - Stefano Vitale, a principal investigator 


The LISA Pathfinder is missing the mechanism to hold two masses (yellow cubes) in place during launch. 


of Paris Diderot University, a physicist on the 
LISA international science team. 

Scientists on the mission say that the impor- 
tant thing is to learn from the delay, to avoid 
similar problems on LISA. With LISA Path- 
finder, ESA initially followed a conventional 
model for managing space missions, assign- 
ing science research groups outside the space 
agency to design the payload — including the 
caging mechanism — while industrial partners 
designed the spacecraft itself. But designers 
found that the spacecraft was operationally 
indistinguishable from its science payload, 
because the positioning of the masses inside 
it is coupled closely to the craft’s ability to keep 
its place in space using the microthrusters. 
ESA then took on the design of the caging 
mechanism together with a contractor, Thales 
Alenia Space in Milan, Italy, which was unable 
to comment before Nature went to print. 

On 10 February, the ESA Science Programme 
Committee is expected to assess options for the 
new design and chart a path forwards. 

Last August, the Astro2010 decadal survey 
of the US National Academy of Sciences ranked 
participation in LISA among its top priorities, 
above a competing project, the International 
X-ray Observatory (IXO). But that recommen- 
dation assumed a successful LISA Pathfinder. 
Xavier Barcons, a physicist at the Cantabria 
Institute of Physics in Santander, Spain, who 
works with IXO, says the problems on Path- 
finder call into question the decision to rank it 
higher than his project. “We also have technical 
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difficulties but we've mastered the basics. LISA 
is a completely new adventure,’ he says. 

He says that it is not clear whether LISA can 
fly by 2025, as the decadal survey assumed. 
But Fabio Favata, head of ESA’s science 
coordination office, says that by uncovering 
problems early, LISA Pathfinder could help 
LISA avoid delays. “The present situation, 
although unfortunate, does emphasize the 
importance of pathfinding,’ he says. m 


CORRECTIONS 

The timeline in the News story ‘Cancer trial 
errors revealed’ (Nature 469, 139-140; 
2011) stated that Harold Varmus asked 
the Institute of Medicine to review Duke 
University’s trials in June 2010. He made 
this request in July 2010. 

The News story ‘Tevatron faces final 
curtain’ (Nature 469, 141; 2011) states that 
the Mu2e experiment will study the decay 
of muons to electrons. In fact, it will look 
for evidence of neutrino-less conversion of 
muons to electrons. 

The story ‘Science fortunes of Balkan 
neighbours diverge’ (Nature 469, 142-143; 
2011) wrongly referred to the Romanian 
Academy of Sciences — it should have said 
the Romanian Academy. Also Bulgaria cut 
the budget of its Academy of Sciences by 
38% to 60 million leva (US$40 million), not 
75 million leva as stated. 


WN TH 
VTE RIGHWAY 


For scientists, collisions at the world’s most powerful particle collider are just the start. 
Nature follows the torrent of data on its circuitous journey around the world. 


ATLAS PARTICLE DETECTOR, SWITZERLAND, 
30 MARCH 2010, / 3:06 LOCAL TIME 


Beneath gently rolling hills between the 
mountains of Switzerland and France, the 
world’s greatest physics experiment starts its 
first real run. Two beams of high-energy pro- 
tons meet head-on at almost the speed of light 
inside the Large Hadron Collider (LHC), a 
giant particle accelerator at CERN, Europe's 
high-energy physics lab. Nanoseconds after the 
protons crash together, their combined energy 
gives birth to heavier particles, which decay in 
an instant into a splatter of lighter debris. 

At the collision point, 92 metres under- 
ground, the 7,000-tonne ATLAS detector 
sees everything. The debris particles pass 
first through the detector’s inner tracker — a 
sophisticated layer of silicon electronics that 
records their paths. Beyond that lie systems 
that measure the energies of the particles. Some 
drag to a stop there, but heavy cousins of elec- 
trons called muons barrel along, flying metres 
from the collision point before being picked up 
by giant, mustard-coloured sensors. 

Microprocessors convert the particles’ paths 
and energies into electronic signals, and select 
a handful of promising collisions for a closer 
look. The data from the chosen collisions zip 
upstairs to a computer farm that discards the 
majority and creates a digital reconstruction of 
those that remain. 

Even after rejecting 199,999 of every 200,000 
collisions, the detector churns out 19 gigabytes 
of data in the first minute. In total, ATLAS and 
the three other main detectors at the LHC pro- 
duced 13 petabytes (13 x 10’° bytes) of data in 
2010, which would fill a stack of CDs around 14 
kilometres high. That rate outstrips any other 
scientific effort going on today, even in data-rich 


BY GEOFF BRUMFIEL 


fields such as genomics and climate science (see 
Nature 455, 16-21; 2008). And the analyses are 
more complex too. Particle physicists must 
study millions of collisions at once to find the 
signals buried in them — information on dark 
matter, extra dimensions and new particles that 
could plug holes in current models of the Uni- 
verse. Their primary quarry is the Higgs boson, 
a particle thought to have a central role in deter- 
mining the mass of all other known particles. 

The architects of the LHC decided in 2001 
to deal with all that data by dividing and con- 
quering. The results from the giant particle 
detectors get parcelled up and sent to a vast 
global network known as the Worldwide 
LHC Computing Grid, the most sophisticated 
data-taking and analysis system ever built. The 
network is as great a technological leap as the 
collider itself, and without it the project would 
quickly drown in its own data. 

The Grid consists of some 200,000 processing 
cores and 150 petabytes of disk space, distrib- 
uted across 34 countries through leased data 
lines (see ‘March of the data’). By combining 
these resources, the Grid enables scientists to 
run vast analyses that would push the world’s 
most powerful supercomputers to the edge. 


CERN COMPUTING CENTRE, 30 MARCH 2010 


Within minutes, the first collisions have 
made their way to a 1970s-era concrete 
building on the other side of CERN’s cam- 
pus. In a white, high-ceilinged room, racks 
containing 50,000 com- 


puting cores undertake NATURE.COM 
a careful reconstruc-  Formoreonthe 
tion of every selected Large Hadron 
collision. Details of Collider, visit: 
each sub-detector’s  go.nature.com/Ixtne9 
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calibration, along with temperature read- 
ings and other environmental data from the 
cavern where ATLAS is housed, are used 
to piece each event back together. ATLAS 
scientists at CERN pull up reconstructions 
showing starbursts of narrow lines spreading 
from the collision points. 

In Grid terminology, the CERN comput- 
ing centre is known as Tier 0. It undertakes 
an initial analysis of the data and stores one 
copy. The physics data from ATLAS on the first 
day of the March run total about 5.2 terabytes 
(5.2 x 10’ bytes), enough to fill around ten 
laptop computers, or five of the digital stor- 
age tapes kept on the floor below the rows of 
processors. The first day’s harvest is modest 
compared with what will follow, but the ATLAS 
experiment has more than a thousand collabo- 
rators waiting for results. If all of them logged 
into CERN and attempted to pull the data from 
the first collisions back to their home institu- 
tions, the network would grind to a halt. 

So instead, the Grid automatically spreads 
copies of the data geographically. Inside a small 
partitioned section of the computing centre, a 
wall of panels bristles with bright-orange fibre- 
optic cable. This is the heart of the system, 
and it routes data to sites across the globe at a 
blistering rate of 5 gigabytes per second. 


OXFORDSHIRE, UK, 30 MARCH 2010 


After CERN finishes the initial analysis, a 
dedicated fibre-optic link carries some of 
the data from the first round of collisions 
more than 800 kilometres to the Rutherford 
Appleton Laboratory, a sprawling research 
park nestled among muddy fields in rural 
Oxfordshire. Here, in a modern office build- 
ing, a computing farm receives the data 


CERN 


through a yellow cable only slightly thicker 
than a phone line. The lab is one of 11 Tier 1 
centres spread around the world, where the 
data are further refined and split. 

Particle physics is a bit like investigating a 
mid-air collision. Nobody is there to witness 
it; instead, the debris is painstakingly collected 
and reassembled to give investigators hints 
as to what happened. In this case, physicists 
divide up the different kinds of particles for 
study. One group looks at muons, for example, 
while another focuses on high-energy y-rays. 
The computers at the lab help by creating doz- 
ens of copies of the data, focusing on various 
aspects of the collision. They are given names 
like datal0_7TeV.00152166.physics_MinBias. 
merge. DESD_PHOJET.* — which contains 
data on photons and narrow jets of particles. 


CHICAGO, ILLINOIS, 15 MAY 2010 


A team of US researchers sends a request for 
data out on the Grid, and information on several 
subsets of the collisions from 30 March travels 
from Oxfordshire via New York to a post-war 
University of Chicago building just two blocks 
from the site of the Manhattan Project’s first 
nuclear reactor. 

Rob Gardner, the physicist in charge of the 
computing facility, says, “What we've assem- 
bled here is a data centre just about as cheaply 
as we can put silicon on the floor” It looks like 
a smaller version of the computing centres in 
Geneva and Oxfordshire, but with one impor- 
tance difference: researchers can bring coffee 
into the Chicago site. “It’s not a clean environ- 
ment,” says Gardner. 

His cluster of computers is one of the Grid’s 
140 Tier 2 sites. Unlike Tier 1s, which undertake 


serious reconstructions of the data, Tier 2 
centres mainly provide storage and computing 
resources and can be accessed by users all over 
the world. 

In an office above the cluster, postdoc 
Antonio Boveia sits at a metal desk with his 
laptop. His machine is at the far end of the 
Grid from CERN, with lines of code scroll- 
ing against the black screen. To conduct an 
analysis — such as one on the decay of the 
Higgs boson into heavy particles known as 
W-bosons — he types in commands in the 
common programming language C++. For 
just one of Boveia’s analyses, he must study 
tens of millions of collisions. Even ifhis laptop’s 
hard drive were 4,000 times its current size 
and could accommodate the data, his proces- 
sor would still take a few years to complete the 
work. “Tt would be impossible,” he says. 

The Grid makes it possible by splitting the 
task. When Boveia enters his request, the Grid 
pulls data from sites such as the one in Oxford- 
shire, then parcels the analysis into thousands 
of separate pieces and spreads it across the 
network. The pieces might be processed at 
CERN, or at a facility in Italy, or, more likely, in 
many places at once. In a matter of days, Boveia 
receives an e-mail alert telling him that the 
analysis is complete. 

The operation does not always work so 
smoothly. The Tier 1 and Tier 2 centres are 
managed locally, which means that they each 
have their own protocols — and problems. In 
the summer of 2009, as simulated data was flow- 
ing through the Grid in advance of the first real 
collisions, fluff from local cottonwood trees 
clogged the Chicago centre's air-conditioning 
unit and forced a shutdown. The same year, 
road workers severed one of CERN’s fibre-optic 
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links in Switzerland, and a fire brought down 
the Tier 1 centre in Taipei, Taiwan, for months. 
When things go wrong, alerts are dispatched by 
e-mail or, occasionally, by phone to an assort- 
ment of emergency contacts around the globe. 

The system relies on goodwill, says Jamie Shi- 
ers, a group leader in CERN’s computing depart- 
ment. “We have no line management over these 
people whatsoever,’ he says. But somehow, the 
global cooperative produces results. 


CERN, 24 DECEMBER 2010, 77:54 


The ATLAS team posts an initial analysis from 
the Chicago group onto the pre-print server 
arXiv.org (ATLAS Collaboration. Preprint at 
http://arxiv.org/abs/1012.5382; 2010). The 
report — on W-bosons produced through 
mechanisms other than the decay of Higgs 
bosons — includes collisions from the first 
day’s run, along with many others. Measure- 
ments of the W-bosons produced show good 
agreement with existing theories. 

The physics data set from 30 March now 
makes up just 0.02% of the total data collected 
by the ATLAS detector. Most physicists on the 
collaboration are using that initial set with- 
out even realizing, as they acquire sections 
for analysis and combine them with other 
data sets. The first hints of a Higgs boson may 
already be stored on a computer disk in Mum- 
bai, Melbourne or one of the many other sites 
to which LHC data are distributed. But even 
if it is there, the Higgs will stay hidden until 
many more petabytes have flowed through the 
Grid. = 


Geoff Brumfiel is a senior reporter for Nature 
based in London. 


MARCH OF THE DATA 


The Worldwide LHC Computing Grid harnesses 200,000 computing cores in 34 countries. The central node of the Grid, called Tier 0, is housed at 
CERN in Geneva, Switzerland; there are 11 Tier 1 sites and 140 Tier 2 sites. This story follows data on a trip from CERN to Chicago, Illinois. 
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Steven Lockley demonstrates an experimental set-up for studying light-sensitive cells in the eye. 


SEEING WITHOUT 
SEEING 


There is more to the eye than rods and cones — the discovery 
of a third photoreceptor is rewriting the visual rulebook. 


ussell Foster remembers his first 

human subject, an 87-year-old 

woman, as she sat in a dark room fac- 

ing a backlit pane of frosted glass. A 
genetic disorder had destroyed the light-sensing 
rod and cone cells in her eyes, leaving her blind 
for the past 50 years. She was convinced that she 
would see nothing. But as the wavelength oflight 
in the room shifted to blue, she reported — after 
some hesitation — a sort of brightness. 

“That just blew us away,’ says Foster, a 
neuroscientist at the University of Oxford, UK, 
and one of the senior authors of a 2007 study 
reporting the finding’. 

Foster and his collaborators had done noth- 
ing to treat the woman’s blindness. Instead, 
her awareness of light owed itself to a class of 
light-sensitive cells discovered in 2002. Stud- 
ies of these intrinsically photosensitive retinal 


BY CORIE LOK 


ganglion cells (ipRGCs) have since revealed 
many surprises. Scientists initially thought that, 
rather than contribute to vision, the cells sim- 
ply synchronized the circadian clock, which sets 
the body’s 24-hour patterns of metabolism and 
behaviour, with changing light levels. However, 
recent work suggests that ipRGCs have been 
underestimated. They may also have a role in 
vision — distinguishing patterns or tracking 
overall brightness levels — and they seem to 
enable ambient light to influence cognitive 
processes such as learning and memory. 


RODS AND CONES DETHRONED 

During the past century, vision scientists 
focused mainly on rods and cones as the light 
sensors of the eye. It took Foster, an outsider 
coming from the circadian-biology commu- 
nity, to uncover some of the first evidence 
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for a third type of photoreceptor. In the early 
1990s, while at the University of Virginia in 
Charlottesville, his lab tested the circadian 
light responses in a mouse mutant with reti- 
nas that degenerate over time, and found that 
they were indistinguishable from the responses 
in mice with normal retinas. But light had no 
effect on the internal clocks of mice whose eyes 
had been removed’. 

Scepticism was strong. Foster recalls 
people walking out during a talk he gave. 
Critics of the research argued that the mutant 
mice probably retained some rods and cones 
that could be setting the clock. So in 1999, 
Foster, who had moved to Imperial College 
London, crossed transgenic mice that had no 
cone cells with mice that had degenerative rod 
cells, thus eradicating both cell types in the off- 
spring. As long as the mice had eyes, they still 
had normal circadian rhythms”. 

The next year, Ignacio Provencio, a former 
graduate student of Foster’s now at the Uni- 
versity of Virginia, Charlottesville, identified 
the light-sensitive molecule melanopsin in the 
mouse and primate ganglion layer’ — a net- 
work of retinal cells that was only thought to 
relay signals from rods and cones to the brain 
(see ‘Light in layers’). The presence of this 
‘photopigment’ suggested that some of these 
cells might also sense light and serve as a new 
class of photoreceptor. Researchers raced to 
isolate the cells and show that they could fire 
in response to light, without input from rods 
and cones. 

The race ended in a tie in 2002. Samer 
Hattar, a neuroscientist at Johns Hopkins 
University in Baltimore, Maryland, and his 
colleagues found that as many as 1% of the cells 
in the mouse ganglion layer express melanop- 
sin, which is most sensitive to blue light®. David 
Berson, a neuroscientist at Brown University in 
Providence, Rhode Island, and his lab showed 
that these cells, ipRGCs, detect light on their 
own and reach into the brain’s pacemaker, the 
suprachiasmatic nucleus’. The two papers 
helped to win over the sceptics, says Russell 
Van Gelder, a neuroscientist and ophthalmolo- 
gist at the University of Washington, Seattle. 
“Things really took off in 2002,” he says. 

Researchers began to develop mouse mod- 
els in which they could selectively block input 
from each of the three photoreceptor types in 
the eye, to probe their individual contribu- 
tions. But rather than distributing jobs neatly 
between cell types, the cells seem to swap roles 
under different conditions. 

It became clear that under low light condi- 
tions, rods can set the body’s clock, but some 
groups have suggested that under different 
conditions cones can as well. Perhaps more sur- 
prisingly, researchers have found that ipRGCs 
may contribute to visual perception. Hattar 
and others fluorescently labelled ipRGCs in 
mice to trace the projections of these cells to 
the brain. They found that ipRGCs reach into 
more brain regions than expected, including 


centres involved in visual processing: the dor- 
sal lateral geniculate nucleus (LGN) and the 
superior colliculus. Mice without functioning 
rods and cones, but with intact ipRGCs, could 
even discriminate patterns in a visual test*. 

This is puzzling. Melanopsin responds 
slowly — on the order of seconds — to changes 
in light, limiting its ability to signal changes in 
spatial information, says Robert Lucas, a neu- 
robiologist at the University of Manchester, 
UK. He and his group found that in mice that 
lacked the gene for melanopsin, and therefore 
had non-photoreceptive ipRGCs, almost half 
of the neurons in the LGN had defective light 
responses. The mice were unable to track back- 
ground light levels, especially in the daylight 
range, suggesting that ipRGCs could be encod- 
ing information about brightness’. 

Researchers now think that ipRGCs and 
rods compensate for each other and may col- 
lectively be allowing the eyes and brain to 
respond to light across a wide range of bright- 
ness levels. Why these different photoreceptors 
share the load in such specific ways is not clear. 
For example, the sensitivity of ipRGCs to blue 
light may make them better suited to detect the 
arrival of dawn and dusk. 


BEGINNING TO SEE THE LIGHT 
The ipRGCs might influence phenomena 
beyond vision and circadian rhythms. Many 
physiological responses have been linked to 
light, such as sleep, migraine pain and seasonal 
affective disorder, and these have recently been 
associated with ipRGC activity. “There's likely 
to be a whole array of physiology that, to some 
degree, is light sensitive,” says Provencio. 
Learning and memory may be improved 
under certain light conditions. Provencio and 
his colleagues presented data last year show- 
ing the effects of light on a mouse model for 
learned fear. Mice were conditioned to associ- 
ate a mild electric shock with a tone cue. Those 
that had learned fear in the presence of light 
froze for longer in response to the tone than 
those that had been conditioned in the dark. 
This effect did not appear in mice engineered 
to lack rods and cones, but did in melanopsin- 
knockout mice, suggesting that the rods and 
cones are driving this light-enhanced learning. 
Still, the researchers have not ruled out a role 
for ipRGCs. These cells route information from 
the eyes to the non-visual centres in the brain, 
including those involved in fear responses. 
Hattar has unpublished data to suggest 
that activating melanopsin with light at vari- 
ous points in the sleep-wake cycle of mice 
impairs learning and memory, even when the 
animals have normal circadian rhythms. This 
could mean that exposure to light at times 
when the body isn’t expecting it can be dis- 
ruptive. And for humans, who have a smaller 
percentage of ipRGCs than mice, experi- 
ments are beginning to show how the cells 
might contribute to physiology and behaviour. 
Steven Lockley, a neuroscientist at Brigham 


LIGHT IN LAYERS 
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Light passes through the ganglion layer and cells in the inner retina to the predominant 
photoreceptors in the eye — the rods and cones. These then send visual information back to 
ganglion cells, which transmit it to visual and non-visual centres of the brain. A subset of 
ganglion cells, called intrinsically photoreceptive retinal ganglion cells (ipRGCs), contain a 
photopigment, melanopsin, and can also encode and transmit information about light directly. 


Inner retina 


and Women’s Hospital in Boston, Massachu- 
setts, and his colleagues tested the reaction 
times of 16 healthy volunteers while they 
were exposed to either blue or green light 
for 6.5 hours. Those exposed to blue light 
had faster reaction times and fewer attention 
lapses when they were asked to report when 
they heard a sound”. 

Lockley says that these different strands of 
research might eventually help to engineer 
‘healthier’ light — using specific wavelengths, 
intensities or even patterns to activate brain 
pathways and improve mood, sleep or mental 
performance. “This research opens up a whole 
new field in terms of light applications, both for 
use therapeutically and for the general popula- 
tion,’ says Lockley. 

Light of certain frequencies can have ben- 
eficial effects, but may also be detrimental 
to health. Lockley has been working with a 
group of light engineers, neuroscientists and 
ophthalmologists, who call themselves the 
Blue Light Group. They met for the first time 
this summer to discuss, among other things, 
any safety issues surrounding blue light, 
including the idea that excessive exposure 
to it might contribute to a type of vision loss 
known as macular degeneration. Many light- 
emitting diodes, a leading technology for 
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energy-efficient lighting, are rich in blue light, 
points out Charles Hunt, a materials scientist at 
the University of California, Davis, who leads 
the group. Could their wider adoption lead to 
health problems for people? 

Humans have evolved to live under natural 
light, says Van Gelder. “Could we be doing 
some damage to our health by poisoning 
the world with wavelengths that we’re not 
evolved to live in?” he asks. Given the emer- 
gence of new kinds of lighting, Hunt says that 
it is important to find out. “We need answers 
quickly,” he says. m 


Corie Lok is Nature’s Research Highlights 
editor. 
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Blogs and tweets are ripping papers 
apart within days of publication, leaving 
researchers unsure how to react. 


claimed The Wall Street Journal head- 

line on 1 July last year. “Who will live to 
be 100? Genetic test might tell,” said National 
Public Radio a day later. 

These and hundreds of similarly enthusias- 
tic headlines were touting a paper in Science’ in 
which researchers claimed to have identified a 
set of genes that could predict human longevity 
with 77% accuracy — a finding with poten- 
tially huge implications for medicine, health 
policy and the economy. 

But even as the popular media was trumpet- 
ing the finding, other researchers were taking 
to the web to criticize the paper's methodology. 
“We expect that most of the results of this study 
will not have the same longevity as its partici- 
pants,” sniped a blog posted by researchers at 
the personal genomics company 23andMe, 
based in Mountain View, California. 

Critics were particularly perturbed by the 
genome-wide association study (GWAS) that 
the authors had used to identify their longev- 
ity genes: the centenarians and the controls in 
the study had been tested with different kinds 
of DNA chips, which potentially skewed the 
results. 

“Basically anybody that does alot of GWAS 
knows this [pitfall], which is why we all said it 
so fast, says David Goldstein, director of Duke 
University’s Center for Human Genome Vari- 
ation, who voiced his concerns to a Newsweek 


C C S cientists discover keys to long life,’ pro- 


TRIAL BY 
TWITTER 


BY APOORVA MANDAVILLI 


blogger the day the study appeared. 

This critical onslaught was striking — but 
not exceptional. Papers are increasingly being 
taken apart in blogs, on Twitter and on other 
social media within hours rather than years, 
and in public, rather than at small conferences 
or in private conversation. In December, for 
example, many scientists blogged immedi- 
ate criticisms of another widely publicized 
paper’ — this one heralding bacteria that 
the authors claimed use arsenic rather than 
phosphorus in their DNA backbone. 


ACHORUS OF DISAPPROVAL 
To many researchers, such rapid response is all 
to the good, because it weeds out sloppy work 
faster. “When some of these things sit around 
in the scientific literature for a long time, they 
can do damage: they can influence what people 
work on, they can influence whole fields,’ says 
Goldstein. This was avoided in the case of the 
longevity-gene paper, he says. One week after 
its publication, the authors released a statement 
saying, in part, “We have been made aware that 
there is a technical error in the lab test used ... 
[and] are now closely re-examining the analysis.’ 
Then in November, Science issued an “Expres- 
sion of Concern’ about the paper’, in essence 
questioning the validity ofits results. 

When asked for a comment by Nature, the 
lead investigator on the paper, Paola Sebas- 
tiani, a biostatistician at Boston University 
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in Massachusetts, said only that she and her 
co-authors “feel it is premature for us to talk 
about our experience because this is still an 
ongoing issue”. 

For many researchers, the pace and tone 
of this online review can be intimidating — 
and can sometimes feel like an attack. How 
are authors supposed to respond to critiques 
coming from all directions? Should they even 
respond at all? Or should they confine their 
replies to the conventional, more delibera- 
tive realm of conferences and journals? “The 
speed of communication is ahead of the 
sheer time needed to think and get in the lab 
and work,” said Felisa Wolfe-Simon, a post- 
doctoral fellow at the NASA Astrobiology 
Institute in Mountain View, California, and 
the lead author on the arsenic paper. Aptly 
enough, she circulated that comment as a 
tweet on Twitter, which is used by many sci- 
entists to call attention to longer articles and 
blog posts. 

To bring some order to this chaos, it looks 
as though a new set of cultural norms will be 
needed, along with an online infrastructure to 
support them. The idea of open, online peer 

review is hardly new. 


> NATURE.COM Since Internet usage 
To join the debate began to swell in the 
about online review, 1990s, enthusiasts have 
go to: been arguing that online 
go.nature.com/b49ej5 commenting could and 


should replace the traditional process of 
pre-publication peer review that journals 
carry out to decide whether a paper is worth 
publishing. 

“It makes much more sense in fact to 
publish everything and filter after the fact,” 
says Cameron Neylon, a senior scientist at the 
Science & Technology Facilities Council, a UK 
funding body. 


FAST FEEDBACK 

In some fields, notably mathematics and 
physics, this sort of public discourse on a paper 
has long been the norm, both before and after 
publication. Most researchers in those fields 
have been depositing their draft papers in 
the preprint server arXiv.org for two decades. 
And when blogging became popular around 
the turn of the millennium, they were quick to 
start debating their research in that form. 

Scientists in other fields seem less willing 
to get involved in pre-publication discus- 
sion. Biologists, in particular, are notoriously 
reluctant to publicly discuss their own work or 
comment on the work of others for fear of 
being scooped by competitors or of offending 
future reviewers of their own work. Adding 
to the disincentive is the knowledge that ten- 
ure committees and funding agencies do not 
explicitly reward online activity. 

As a result, several journals — including, in 
2005, Nature — have tried and mostly failed 
to interest scientists in various forms of open 
review. “Most papers sit in a wasteland of 
silence, attracting no attention whatsoever,’ 
says Phil Davis, a communications researcher 
at Cornell University in Ithaca, New York, and 
executive editor of The Scholarly Kitchen, a 
blog run by the Society for Scholarly Publishing 
in Wheat Ridge, Colorado. 

Journals have had a little more success with 
post-publication peer review in the form 
of comments to the online versions of their 
papers. But the discussion is hardly 
vigorous, largely because the jour- 
nals have usually solicited these 
post-publication critiques on 
their own websites, rather than 
on popular social networking 
sites. 

“Who in their right mind 
is going to log on to the PLoS 
One site solely to comment 
on a paper?” asks Jonathan 
Eisen, academic editor-in- 
chief of PLoS Biology, and a 
prolific blogger and tweeter. “I 
guarantee that there are more 
comments on Twitter about a PLoS 
paper” 

The question for researchers is how to 
deal with this ad-hoc analysis of papers. 
Unstructured, unruly and often anonymous, 
online commenting can be exasperating for 
biologists used to more conventional means 
of discussion. Like Sebastiani, for example, 


Wolfe-Simon initially tried to stay out of 
the brouhaha over the arsenic paper. “Any 
discourse will have to be peer reviewed in 
the same manner as our paper was, and go 
through a vetting process so that all discus- 
sion is properly moderated,’ she said when 
the controversy first erupted. She and a co- 
author later did provide answers to a few of the 
criticisms on her website. 

But Goldstein, who has also had publica- 
tions on the receiving end of negative online 
reviews, tries to take the process in his stride. 
“T think if the work is solid, it holds up over 
time and this chatter is not going to hurt solid 
work,’ he says. Nonetheless, he adds, “there can 
be a herd mentality to this, which one wants 
to be really careful of” — especially for exam- 
ples such as the longevity and arsenic papers, 
for which neither the rapid spike in fame nor 
the equally sharp fall into disrepute may be 
fully justified. 

One solution may lie in new ways of 
capturing, organizing and measuring all these 
scattered inputs, so that they end up making a 
coherent contribution to science instead of just 
fading back into the blogosphere. Perhaps the 
most successful and interesting experiments of 
this type can be found at websites such as Fac- 
ulty of 1000 (F1000) and thirdreviewer.com, 
and in online reference libraries such as 
Mendeley, CiteULike and Zotero, which allow 
users to bookmark and share links to online 
papers or other interesting sites. 

F1000, which was launched in 2002 and 
evaluates papers from journals across biology, 
is among the best known of these websites. It 
now relies on a ‘faculty of more than 10,000 
peer-nominated researchers and clinicians 
who select, evaluate and rate papers with a 
score of 6 (‘recommended’), 8 (‘must read’) or 
10 (‘exceptional’). The individual scores are 
then combined using a formula to generate the 
paper’s F1000 article factor. These scores, in 
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turn, are making some 
appearances in tenure 
packages and grant 
applications. “It’s the 
only one we've been using ¥¢ 
in any systematic way,’ says Liz _ 
Allen, who leads post-award evalua- 
tion at the Wellcome Trust in London. “Tt adds 
another dimension to the citation index.” 
However, critics note that F1000 rankings 
tend to correlate closely with traditional cita- 
tions, which suggest that they add little, if any, 
extra value. And most papers never attract the 
attention of the faculty members, so that they 
are never ranked at all. Even one as talked- 
about as the longevity paper garnered only a 
single rating on F1000: a must-read score of 8. 
For comparison, the currently highest-ranked 
paper on the site has an aggregate score of 62, 
and scores of 20 or more are common. 
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META-TWITTER 

Given the vagaries of such measures, there is a 
growing interest in methods that would aggre- 
gate and quantify all of the online responses 
and evaluations of a paper — producing what 
Neylon and some others are referring to as 
‘alt-metrics’ — and compare it with more 
conventional metrics. 

“As scholars migrate to newer forms of com- 
munication, it becomes very important to 
measure what they’re doing and to compare,” 
says Jason Priem, a second-year graduate 
student in information science at the Univer- 
sity of North Carolina in Chapel Hill, who is 
focusing his study on alt-metrics. 

Neylon is leading a £30,000 (US$50,000) 
grant proposal to create and test a working 
alt-metrics prototype that would rapidly meas- 
ure a paper's impact by assessing all the activ- 
ity surrounding it online. In addition, he and 
many of his colleagues champion a completely 
online system of pre-publication peer review 
that would build on the arXiv.org model, and 
would replace what they see as a flawed process 
with a more egalitarian and transparent one. 

That last step, however, may be a bit farther 
than most scientists are willing to go — even 
the ones who energetically blog and tweet their 
post-publication reviews. Although the latter 
activity is “a nice secondary mechanism for 
catching things”, says Goldstein, “I think we 
do not want it to be just a commentary free- 
for-all as the only arbiter of quality.” 

“Tt’s exactly like what’s said about democ- 
racy, he adds. “The peer-review process isn't 
very good — but there really isn’t anything 
that’s better.” m 


Apoorva Mandavilli is a writer based in 
New York. 
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Get ready for the flood of 
fetal gene screening 


Regulators, doctors and patients need to prepare for the ethical, legal and practical 
effects of sequencing fetal genomes from mothers’ blood, says Henry T. Greely. 


he world’s news media was buzzing 
Tis week after researchers showed 

that a blood test for mothers could 
detect Dowr’s syndrome in their fetuses’. 
Last month, two research groups independ- 
ently published proof that the fetal genotype 
— the genetic status at a given locus — can be 
derived for thousands of sites from samples 
of fetal DNA with just a 10-millilitre blood 
draw from a pregnant woman”. 

The brave new world of widespread 
prenatal genetic diagnosis has been always 
‘arriving’ since Nature published a paper by 
Danish researchers Fritz Fuchs and Povl Riis 
in 1956, reporting the first prenatal genetic 


testing in humans’. With non-invasive 
prenatal genetic diagnosis (NIPD) it may 
finally have arrived. Checking for hundreds 
or thousands of traits with one blood test, 
early in pregnancy, could move prenatal 
genetic testing from uncommon to routine. 

That possibility challenges all societies to 
decide for which ends and by what means 
they want such tests to be used, raising hard 
questions about, among other things, abor- 
tion, disability rights, eugenics and informed 
consent. 

Prenatal genetic testing has been clinically 
available since the late 1960s, but the costs, 
inconvenience and especially the miscarriage 


risks have limited its use. Each year, less than 
2% of pregnant women in the United States 
undergo amniocentesis (in which a small 
amount of amniotic fluid containing fetal 
cells is taken for analysis) or chorionic villus 
sampling (CVS — in which fetal tissue is 
extracted from the placenta). Both proce- 
dures increase the risk of miscarriage. Until 
now, any given sample could be tested for 
only one or two conditions, typically chromo- 
somal abnormalities such as trisomy 21, 
the cause of Down's syndrome. 

These factors combined to limit recom- 
mended use of these methods to women 
with a higher risk of having a fetus with > 
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> aparticular disease. Most frequently this 
has meant women over the age of 35, whose 
chances of carrying a fetus with Down’s 
syndrome are greater than the risk of a 
miscarriage caused by the procedure. 

Biologists have known for decades that 
some fetal cells pass through the placenta 
and into the mother’s blood stream. Technical 
problems have hampered attempts to isolate 
individual fetal cells and, even when such 
cells could be found, there was no guarantee 
that they were from the present pregnancy. 
Analysing the free-floating fragments of fetal 
DNA that exist in a pregnant woman's blood 
serum is proving more successful. 

Blood contains billions of DNA fragments 
released when cells die and are broken up by 
enzymes. Even early in pregnancy, 5-10% 
of that ‘cell-free’ DNA in pregnant women 
comes from the fetus. Thanks to cheap and 
sensitive sequencing techniques, this DNA 
can be examined and aspects of the fetus’s 
genome analysed. For some traits, such as 
paternally inherited dominant conditions, 
this can be done by looking for DNA variants 
— alleles — that the mother does not carry. 
For other traits, the number of copies of each 
variant can be used to determine how many 
copies of a chromosome the fetus carries, as 
well as how many copies of which alleles. 


IN THE CLINIC 
Non-invasive prenatal genetic diagnosis 
is already in clinical use for fetal blood- 
type screening. A woman of blood type 
Rhesus (Rh) negative can create antibodies 
against the red blood cells of a fetus of type 
Rh positive, injuring that fetus, or subse- 
quent fetuses. In many countries, pregnant 
Rh-negative women routinely receive pro- 
tective antibodies. Now, in several countries, 
including the United 


Kingdom, the Neth- “Commercial 
erlands and France, development 
cell-free DNA analysis of these 

is being used to deter- methods 

mine the Rhtypeofthe seems likely 
fetusand the antibody within five 

is only injected when years.” 
needed. 


The potential of NIPD goes way beyond 
Rhesus screening. Two of the leading 
researchers in cell-free fetal DNA testing — 
Dennis Lo of the University of Hong Kong 
and Steve Quake of Stanford University 
in California — use different methods to 
analyse fetal cell-free DNA from maternal 
serum. Each has demonstrated the ability 
to detect aneuploidies — missing or extra 
chromosomes, such as in trisomy 21 (refs 
5, 6). Last month, both researchers pub- 
lished proof that the fetal genotype could be 
derived for thousands of sites from cell-free 
fetal DNA*’ — demonstrating the possibility 
of using maternal blood to test for all fetal 
genetic traits. 


There seems to be no technical barrier, 
given increasingly cheap genotyping and 
sequencing, to being able to test one sample 
simultaneously for chromosomal abnormali- 
ties; for single-gene diseases, such as cystic 
fibrosis, sickle-cell anaemia, and Tay-Sachs 
disease; and for various non-disease genetic 
traits such as sex. 

Commercial firms are already interested. 
Sequenom in San Diego, California, is work- 
ing with Lo; another, Artemis Health of Menlo 
Park, California, is working with Quake; and 
still others are also exploring the technol- 
ogy. For-profit development of these meth- 
ods seems likely within five years, at least for 
chromosomal abnormalities, such as trisomy 
21, and possibly for single-gene traits. 

The scope and consequences of such test- 
ing will, of course, depend in large part on 
its accuracy. If NIPD is so inaccurate that it 
requires amniocentesis or CVS for confir- 
mation, its influence will be limited. But the 
improving power, and decreasing cost, of 
DNA sequencing make it likely that the accu- 
racy of these tests would be high. If necessary, 
samples can be genotyped or sequenced 
to greater and greater depth, particularly 
as costs drop, and additional samples, if 
needed, are just a blood draw away. 

When such testing does take off — and itis 
when, not if — the public controversy will be 
about its uses. In vitro fertilization provides 
one precedent. More than 30 years from its 
first use, debates continue about whether it 
can be used by unmarried people, homosexu- 
als and elderly women — and about who will 
pay for it. Preimplantation genetic diagnosis 
isa closer example, with strong disagreements 
about its use for sex selection, trait selection 
and the creation of ‘saviour siblings. With 
NIPD, abortion opponents will want little or 
no use of tests that will increase the number 
of pregnancies terminated. Some people will 
be concerned about technologies that prevent 
the birth of people with particular disabilities, 
both for the message that might send about 
the worth of those who are disabled and for 
its practical effects on research, treatment and 
support for those with disabilities. 

And the spectre of eugenics will loom 
over the whole discussion. Some will oppose 
parental choices about the characteristics of 
their babies; others will worry that parental 
choice will be influenced, or trumped, by the 
decisions of governments, health-care sys- 
tems or other institutions. Fears of eugenics 
will increase as such testing moves from fatal 
diseases to less serious medical conditions 
and then on to non-medical characteristics 
— sex selection today; skin, hair and eye 
colour tomorrow; perhaps, eventually, traits 
such as some cognitive or physical abilities. 
Still other kinds of uses will pose problems. 
Sometimes, for instance, parents with par- 
ticular conditions, such as genetic forms 
of deafness, may want to ensure that their 
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children have the same condition. Or some 
women, or the men in their lives, may want 
to move paternity testing in utero. 

Some of these concerns exist today — witness 
for instance the dramatic skewing of live- 
birth sex ratios in China and India brought 
about by cheap and accessible ultrasound. But 
they will only become more immediate and 
more important with widespread NIPD. 

Beyond these big questions lurk crucial 
operational details. Some involve the test- 
ing itself. Will such tests be regulated to 
ensure that they are safe and effective and, 
if so, how? Will the testing laboratories be 
subject to oversight that guarantees they 
perform the tests accurately? Who will 
pay for millions of genetic tests, and for 
the abortions that follow? The burgeoning 
controversies over the regulation of genetic 
testing, whether or not they are ‘direct to 
consumer, provide one very contemporary 
example of these questions; the regulation 
of, and payment for, IVF and preimplanta- 
tion genetic diagnosis provides another. 

Much of the social impact — and the impact 
on the medical system — will depend on how 
widely such testing is used. Some of that will 
depend on those who fund health care and 
whether they see this testing as yet another 
cost or as a way to save money by avoiding the 
births ofhigh-cost children. Part of the impact 
will depend on the legal system. Ifa test is clin- 
ically available and a physician does not offer 
it to a patient, at least in the United States, a 
physician could be liable through a ‘wrong- 
ful birth’ suit for the health costs of a child 
whose birth might have been prevented. 


TIME TO TALK 

In California currently, about two-thirds 
of pregnant women opt for non-invasive 
screening for Down’s syndrome and neural 
tube defects. Ifthe same fraction of pregnant 
women opt for NIPD, the United States alone 
would move from conducting fewer than 
100,000 fetal genetic tests a year to about 
3 million. Where will we find, or create, 
the professionals to provide genetic advice 
to these patients? And, of course, even if 
widely adopted, use of NIPD is unlikely to 
be uniform. It seems likely to vary between 
countries but also within countries, based 
on religion, ethnicity, education and other 
characteristics. In California, for example, 
it is thought that women with more educa- 
tion are more likely to accept screening and 
Hispanic women are less likely. What social 
issues will such disparities raise? 

For parents who do choose NIPD, we will 
need to make sure they truly choose it. Today, 
amniocentesis and CVS are invasive proce- 
dures, typically prepared for over time. The 
parents and their physician decide that their 
fetus is at high risk of having a genetic dis- 
ease, they go through genetic counselling and 
informed consent, and an invasive procedure 


is scheduled for several days later. Confronted 
with a long needle or a transvaginal probe, 
few, ifany, women will undergo either proce- 
dure without understanding that something 
serious is happening. 

But if NIPD requires just one more tube 
of blood from the mother — and just one 
more signature on one more form — how 
can we ensure that parents understand what 

they are consenting 


“How can we to? Already some 
ensure that who get results of 
parents blood-based screen- 
understand ing tests for the risk 
what they are of Down's syndrome 
consenting to?” are shocked to learn 


they ever agreed to 
the test. NIPD greatly increases what is at 
stake; parents must not be surprised when 
genetic-test results arrive. And, of course, 
that consent will be even more complicated 
when hundreds of genetic traits can be 
tested, not just one or two. 

These questions, and many others, have to 
be answered, and soon. Some of the answers 
may be the same across different cultures, 
others need careful national attention. Views 
and practices differ from country to country 
on abortion, on freedom of parental choice, 
on funding health care and on many other 
relevant considerations. 


A few European groups have been studying 
NIPD. A European Union consortium called 
SAFE — the Special Non-invasive Advances 
in Fetal and Neonatal Evaluation Network 
— studied the scientific, medical and ethical 
issues around more limited NIPD applica- 
tions for several years’; the PHG Foundation 
— the UK Foundation for Genomics and 
Population Health — convened a UK expert 
group that produced a report® on NIPD; and 
amore recent British project called RAPID 
— Reliable Accurate Prenatal non-Invasive 
Diagnosis — continues to work on the issues’. 
Little has been done in the United States'” and 
almost nothing elsewhere. 

Professional organizations, in medicine 
and in genetics, need to get involved, both 
in training their members about these tech- 
nologies and in beginning to consider guide- 
lines for their use, especially with regard to 
informed consent. Regulators, companies and 
consumer advocates need to be talking about 
pathways for assuring the safety, efficacy 
and quality of NIPD testing. In the United 
States, the Food and Drug Administration 
should start that process immediately. And 
it is time for ethics commissions, such as the 
US Presidential Commission for the Study of 
Bioethical Issues, to report on these issues. 

Most importantly, we need to start conver- 
sations, between all those concerned, about 


the limits, if any, to place on this powerful 
technology. Whether we view NIPD gladly 
as a way to reduce human suffering, war- 
ily as a step towards a eugenic dystopia, or 
as a mix of both, we should agree that the 
better we prepare, the more likely we are to 
avoid the worst misuses of this potentially 
transformative technology. = 


Henry T. Greely is at the Center for Law 
and the Biosciences, Stanford Law School, 
Stanford, California 94305, USA. 
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A lesson in sharing 


Earth scientists need better incentives, rewards and mechanisms 
to achieve free and open data exchange, says David Carlson. 


hen the polar-research commu- 
nity planned the International 
Polar Year (IPY) of 2007-08, it 


embraced a revolutionary goal: to establish 
free, open and ready access to all data. After 
decades of reports with ‘data and combina- 
tions of ‘integrated; ‘interoperable’ and ‘dis- 
tributed’ in their titles, the IPY presented an 
ideal test case — interdisciplinary but limited 
in duration and regional in focus. Yet the com- 
munity found inadequate services, almost no 
international support and few solutions. 

We have come out of the IPY with a rich 
burst of data, but the information uses the 
jargon and units of specialities from anthro- 
pology to astronomy, referenced to every- 
thing from Cartesian coordinates to postal 
codes. And despite the best efforts of the IPY 
Data and Information Service (www.ipydis. 
org), we cannot say how users might discover 
or access IPY data five years hence. Indeed, 
it emerged just last week that an upcoming 
report from the US National Academy of 
Sciences in Washington DC identifies the 
lack of data sharing as a barrier to under- 
standing rapid changes in polar ecosystems 
(see Nature 469, 145; 2011). 

What caused these failures? Technical 
impediments exist relating to formats, per- 
missions, bandwidth and so on, but the real 
problem is behaviour. The Earth sciences, 
like the science community as a whole, lack 
incentives for widespread data exchange. 

Long before the film Avatar popularized 
it, learned from engineering colleagues the 
whimsical but useful term ‘unobtainium’ — 
used to describe something perfect but elu- 
sive. A perfect data-sharing system is science’s 
unobtainium. We must respond creatively to 
the challenge. Steps begun as part of the IPY 
by the Earth-science community include 
establishing a polar information commons 
and instigating a journal for data publication 
and citation. 


ALEGACY OF CONFUSION 

The challenges of preserving and sharing IPY 
data have come up repeatedly. Meteorological 
data from the first IPY in 1882-83 emerged in 
digital, accessible form only during the plan- 
ning of the latest IPY. Data from the 1932-33 
IPY were scattered: some were rediscovered 
only in recent years at the Danish Meteorolog- 
ical Institute in Copenhagen. By the time of 
the 1957-58 International Geophysical Year, 
the International Council for Science (ICSU) 
was forming World Data Centres (WDCs) 


to help solve the problem. There are now 
more than 50 WDCs, all of which pledged to 
support the latest IPY. Most struggled. Few 
received increased funding to respond to new 
or bigger IPY data streams, and the system 
had no mechanisms for handling the ecologi- 
cal or social threads of the IPY programme. 
The current WDCs, which have been sup- 
plemented by national and speciality data 
centres, cannot meet the needs of modern 
international interdisciplinary science. 

The ICSU is establishing a World Data Sys- 
tem to reform and reinvigorate the WDCs, 
and the World Meteorological Organization 
is upgrading its global information system. I 
endorse these efforts. But without fundamen- 
tal changes to the incentives for data sharing, 
scientists will only perpetuate bad habits. 

Data centres depend on willingness to 
share. All IPY projects opted in to an explicit 
free and open data-sharing policy (go.nature. 
com/byf9b4). But many researchers do not 
recognize, much less comply with, this policy, 
and few national funding organizations have 
the motivation or means to enforce it. Many 
researchers worry about others ‘stealing’ or 
misusing their data, and so hoard them. 

To circumvent these attitudes, a team 
including myself developed the concept of 
a Polar Information Commons (PIC; www. 
polarcommons.org) data label in 2009. PIC 
data can be freely accessed and used accord- 
ing to voluntary rules on attribution, citation 
and recognition, version control and notifi- 
cation, and appropriate 


use. The ideahasbeen NATURE.COM 
favourably received in For more on data- 
the scientific commu- sharing mechanisms 
nity, anda pioneering and practice, visit: 
group of polar data — go.ature.com/kpfnlz 


COMMENT 


centres in Australia, Canada, Japan, Norway, 
Britain and the United States have indicated 
their support. However, when it comes to the 
nitty-gritty of making data fully available, the 
PIC often stalls in institutional or national 
legal departments. The collection of PIC- 

labelled data is growing — but slowly. 
Another effort is the Earth System Science 
Data (ESSD) journal, which I started with 
Hans Pfeiffen- 


“Aperfect data- _ berger, head of IT 
sharing system infrastructure at 
is science’s the Alfred Wege- 


ner Institute for 
Polar and Marine 
Research in Bremerhaven, Germany. It pub- 
lishes complex and comprehensive data sets, 
giving data providers credit just as for a tradi- 
tional publication. The journal uses a Creative 
Commons copyright policy to encourage free 
use of articles as long as the original authors 
and citation details are identified, and insists 
that authors deposit their data in a well- 
known open-access repository of their choice. 
Itis asmall effort so far — ESSD has published 
28 data sets since its first issue in 2009, and 
remains unique in Earth sciences. We hope it 
will inspire similar projects. 

The IPY crystallized our view of the 
unobtainable ideal, but hinted at solutions. 
The grand vision of the IPY ran aground on 
practicality and pragmatism, so we must take 
practical steps to change behaviours. = 


‘unobtainium’.” 
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Izabella Godlewska de Aranda’s painting Cosmic Joy! (2009) hints at the idea of many universes. 


COSMOLOGY 


The untestable 
multiverse 


George Ellis reminds us that Brian Greene’s beguiling 
book on parallel worlds is more theory than fact. 


osmology must seem odd to scientists 
( in other fields. More and more accu- 
rate data about the distant Universe 
are being generated by high-tech observa- 
tional techniques, giving rise to an era of 
‘precision cosmology — but to explain these 
impressive data, cosmologists are increas- 
ingly turning to untestable theories. 
In The Hidden Reality, theoretical physicist 
Brian Greene explores one of the strangest 
proposals: that we live in a multiverse. 


This fashionable concept supposes that 
large, perhaps infinite, numbers of separate 
universes exist in parallel to ours. Using 
well-constructed analogies, Greene explains 
nine different multiverse proposals, ranging 
from simple extrapolations of cosmologi- 
cal models to those based on quantum field 
theory, string theory and pure logic. 
Greene carefully sets out the reasoning 
behind each proposal and the grounds for why 
it might be true. His arguments are absorbing 
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and convincing on 
their own terms. But 
they inexorably lead us 
farther from the gold 
standard of testability 
that is the scientific 
method. The beguil- 
ing text moves beyond 
established science into 
philosophical specula- 
tion. 

Greene's nine types 
of multiverse are as 


The Hidden 


Reality: Parallel 
Universes and the 


Deep Laws of the 
follows. First, if space Cosmos 
extends forever, an BRIAN GREENE 
infinite number ‘nopt/Allen Lane: 

F «2011. 384 pp. 
of domains simi $29.95/£25 


lar to ours might lie 
beyond the part of 
the Universe that we can see. Second, some 
versions of inflationary theory — the idea 
that the newborn Universe had a fleeting 
period of super-fast accelerating expansion 
— predict the existence of innumerable 
other universes, with different character- 
istics from our own. Third, string theory, 
the pre-eminent theory of quantum gravity, 
suggests that our Universe might be one of 
many four-dimensional ‘braneworlds float- 
ing in a higher-dimensional space-time. 

This option is developed further in the 
fourth and fifth proposals, which involve 
cyclic universes, or variations on physical 
parameters that are possible in the string- 
theory landscape. The sixth is a quantum 
mechanics idea that many worlds simultane- 
ously exist as branches of the wave function 
of the Universe. The seventh suggests that 
the Universe is a holographic projection. The 
eighth states that we live in one of a set of arti- 
ficial universes created as simulations on a 
super-advanced computer. The ninth argues 
that it is a philosophical necessity that every 
possible universe must be realized some- 
where, in “the grandest of all multiverses” 

By presenting this plethora of theories, 
Greene gives the impression that the multi- 
verse is on a sound scientific footing, but 
these nine arguments are mutually exclusive. 
We do not know how to test which is right, if 
any, because we cannot make direct observa- 
tions of domains beyond the observational 
horizon — the greatest distance that light can 
have travelled towards us since the Universe 
became transparent to radiation 300,000 years 
after the Big Bang. Given this lack of evidence, 
there is a viable tenth option: that there is no 
multiverse at all. 

Greene cites indirect evidence to support 
the multiverse idea. The values of the physi- 
cal parameters seem to be fine-tuned to allow 

life. For example, if the 
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of the Universe — was much different, then 
galaxies would not exist and we would not be 
here to make measurements. Similarly, the 
strength of the strong nuclear force permits 
atoms, and hence humans, to exist. Such 
anthropic reasoning invokes multiverses 
where there is some likelihood that physical 
constants take on different values in each. 

But probabilistic arguments only make 
sense if these parallel universes actually exist. 
And logic cannot prove their existence. For 
instance, a multiverse model may predict a 
likely value of the cosmological constant, but 
the reverse is not true. A particular measure- 
ment of the cosmological constant does not 
require a multiverse. Nor can the multiverse 
concept be disproved by any specific observa- 
tionally determined value of the cosmological 
constant, for multiverses can accommodate 
any value. These arguments can only provide 
probabilistic consistency tests for some kinds 
of multiverse. 

So one can motivate multiverse hypotheses 
as plausible, but they are not observationally 
or experimentally testable — and never will 
be. It is easy to support your favourite model 
over others because no one can prove you 
wrong — you can simply adjust its param- 
eters to fit the latest information. If the 
Universe is a simulation (option eight), then 
anything is possible. However, the existence 
of a computer allowing such a simulation is 
not remotely feasible. Scientists are beginning 
to confuse science with science fiction. 

Greene, to his credit, devotes a chapter to 
the question of whether the multiverse idea 
is a scientific theory or not. He believes it is, 
and even supports the extravagant claim that 
infinities exist — infinite numbers of universes 
hosting countless galaxies. This leads to well- 
known paradoxes, such as the infinite repeti- 
tion of everything because of the finiteness of 
possibilities. But again, there is no way to test it, 
because infinity is always beyond reach — and 
so will not plausibly exist in physical reality, as 
mathematician David Hilbert argued. 

The gap in current theories that warrants 
pursuing such untestable theories is our 
inability to predict firmly why physical 
constants have the values they do. If a fun- 
damental theory were to be proposed that 
explained them, the drive for a multiverse 
explanation would fall away. But the puzzle 
of why these values allow life would remain. 

The multiverse argument is a well-founded 
philosophical proposal but, as it cannot be 
tested, it does not belong fully in the scientific 
fold. Read The Hidden Reality with enjoyment, 
but beware its misleading title. Greene is not 
presenting aspects of a known reality; he is 
telling of unproven theoretical possibilities. m 


George Ellis is professor emeritus of applied 
mathematics at the University of Cape 
Town, Rondebosch 7701, South Africa. 
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Books in brief 


The 4% Universe: Dark Matter, Dark Energy, and the Race to 
Discover the Rest of Reality 

Richard Panek HOUGHTON MIFFLIN HARCOURT 320 pp. $26 (2011) 
The more cosmologists have learned in recent decades, the less 
they seem to know about the Universe. Just 4% of it is composed 

of normal matter, made of protons and neutrons. The rest is ‘dark 
matter’ of unknown origin, and ‘dark energy’, a force that pulls 
space apart. Science writer Richard Panek engagingly tells the story 
of the discovery of these cosmic ingredients through interviews with 
the astronomers who unearthed them, charting the often-bitter 
rivalries between competing researchers. 


In the Light of Evolution: Essays from the Laboratory and Field 
Edited by Jonathan B. Losos ROBERTS AND COMPANY PUBLISHERS 
336 pp. $49.95 (2011) 

Leading experts in evolution report from the field in this collection 
of plainspoken essays, providing a valuable resource for non- 
specialists wanting to improve their understanding of this vital 
topic. Historian Janet Browne writes on Charles Darwin; writer 
Carl Zimmer muses on microbes; Daniel Lieberman discusses 
the evolution of human bipedalism; Marlene Zuk and Teri Orr 
examine sexual selection; and Neil Shubin unearths tetrapods and 
evolutionary steps in the fossil record. 


Animal Rights: What Everyone Needs to Know 

Paul Waldau OXFORD UNIVERSITY PRESS 256 pp. $16.95 (2011) 
Animal-rights theory has a long history, as legal scholar Paul 
Waldau describes in this primer. Setting out the basics of animal 
protection law, he explains our different attitudes to pets and wild 
animals, research and work animals, and the creatures we eat. He 
chronicles how our understanding of animal welfare has developed 
and how it has led to protective measures and legislation as 

well as a passionate — and often controversial — animal-rights 
movement. He ends with proposals for a more unified framework 
for animal rights. 


The Sublime Engine: A Biography of the Human Heart 

Stephen Amidon and Thomas Amidon RODALE Books 224 pp. 

$24.99 (2011) 

The human heart is a fundamental organ on which we rely for our 
life force. For millennia, it has also been imbued with symbolism 
across many cultures. In their biography of the heart, writer Stephen 
Amidon and cardiologist Thomas Amidon trace the influence of the 
body’s central pump through history, science, religion, literature and 
popular culture. From ancient Egypt and Greece, through the Middle 
Ages to the modern era, they ponder the miracle that is encased 
within our ribs. 


Haldane, Mayr, and Beanbag Genetics 

Krishna Dronamraju OXFORD UNIVERSITY PRESS 296 pp. $34.95 (2011) 
In the mid-twentieth century, two great biologists — J. B. S. Haldane 
and Ernst Mayr — clashed about the value of mathematical theories 
to evolution. Mayr, in 1959, queried Haldane’s ‘beanbag’ approach 
to genetics, which portrayed evolution merely as “the adding of 
certain beans to a beanbag and the withdrawing of others”. Haldane 
refuted Mayr’s position in a witty essay in 1964. Geneticist Krishna 
Dronamraju relates their vigorous exchange through the scientists’ 
correspondence. 
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Delia Derbyshire, among other electronic-music pioneers, started out in the BBC Radiophonic Workshop. 


Pioneers of sound 


Two books chart the laboratory origins of avant-garde 
electronic music, finds Marc Weidenbaum. 


fter decades of sonic experimenta- 
A two maverick institutions met 

their ends within years of each other. 
In 1992, US composer John Cage, a prolific 
pioneer of electronic music, passed away four 
weeks shy of turning 80. And in 1998, the BBC 
Radiophonic Workshop closed down after 
40 years of developing high-tech sounds. Two 
books — Kenneth Silverman’ biography of 
Cage, Begin Again, and Louis Niebur’s account 
of the Radiophonic Workshop, Special Sound 
— describe the technical innovations of these 
institutions and suggest that they were victims 
of their own notoriety. 

Created in 1958, the BBC Radiophonic 
Workshop produced electronic sounds for 
radio and television franchises, including 
landmark science-fiction series such as 
Quatermass, The Hitchhiker’s Guide to the 
Galaxy and, most enduringly, the signa- 
ture tune for Doctor Who. Founded with 
advanced electronics equipment but a mod- 
est budget, the studio’s edge was its ability to 
tinker within its means. It released some of its 


Special Sound: The Creation and Legacy of 
the BBC Radiophonic Workshop 

LOUIS NIEBUR 

Oxford University Press: 2010. 272 pp. 

$27.95, £17.99 


Begin Again: A Biography of John Cage 
KENNETH SILVERMAN 
Knopf: 2010. 496 pp. $40, £27.95 


experiments to the public in pamphlets that 
included instructions and wiring diagrams. 
Eventually, electronics became so affordable 
that freelance composers undercut BBC 
economics. Alien noises, manipulated 
recordings and synthesized instrumenta- 
tion were replicated throughout pop culture, 
diminishing the workshop’s individuality. 
The Radiophonic studio had a lab-coat 
reputation from the start. The small team 
innovated instinctively, adopting new tech- 
nologies such as voltage-controlled wave- 
form synthesis; adapting existing ones such 
as the turntable; and developing new sounds 
and approaches to enlivening narratives 
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with audio. It owed its existence not to the 
BBC music department, which baulked at 
electronics, but to the drama and features 
departments, which supported the creation 
of a team to expand the sonic palette of their 
productions. The name radiophonic was 
chosen over another term, electrophonic, 
which was deemed to be too closely associ- 
ated with brain research. 

Born in Los Angeles in 1912, John Cage 
learned to tinker from his father, a serial 
inventor of everything from submarines to an 
‘Invisible Ray Vision System. From an early 
age, Cage was a good orator and had a media- 
genic quality. Once he left home, his life took 
ona Zelig-like fluidity: boarding with art col- 
lector Peggy Guggenheim, befriending author 
John Steinbeck and studying with composer 
Arnold Schoenberg. In his later life, Silver- 
man writes, Cage was so celebrated that the 
festivities probably contributed to his death. 
His birthday milestones involved exhausting 
global galas that curtailed his composing. 

Written for ensembles large and small, 
many of Cage’s compositions invoked chance 
and interactivity. He championed percussion 
as an instrument in Western music and devel- 
oped the ‘happening’ (the event as art) and 
the ‘prepared piano (in which objects rattle 
on its strings). Today, Cage is best known for 
the silent piece less than five minutes long 
that contains no notes: 4°33” was first played 
in 1952 and was inspired by his experience 
in an anechoic chamber. His technological 
experimentation was advanced for its time — 
Cage's early 1950s tape-splicing techniques, 
for example, did not reach their full potential 
until hip-hop samplers adopted them in the 
late 1970s and 1980s. 

Many scientists were among Cage’s close 
associates. Johan Wilhelm ‘Billy’ Kliiver, a 
Bell Laboratories physicist, worked with 
him to invent a photoelectric system that 
allowed dancers to trigger sounds. Bell 
Labs employee Max Mathews — later of 
Stanford University, and after whom the 
widely used electronic music software 
Max/MSP was affectionately named — built 
Cage a 50-channel mixing board for Leonard 
Bernstein's performance of his Aflas Eclipti- 
calis. And Lejaren Hiller, a former DuPont 
chemist, helped Cage to become computer 
literate in 1967, allowing him to accomplish 
long-envisioned projects that he had previ- 
ously deemed too complex to achieve. 

Begin Again — the title of which reflects 
the continual refreshment of Cage’s ideas 
through his conversations with these many 
players — is more descriptive than critical. 
Silverman argues for the intensity of thought 
and emotion captured 
in Cage’s body of 
work. Known for his 
philosophical mus- 
ings on nature (he was 
an avid mycologist), as 


> NATURE.COM 

An interview with Tod 
Machover, the man 
behind Guitar Hero: 
go.nature.com/34hf6n 


well as on chance, technology and other 
subjects, Cage became both a public intel- 
lectual and a celebrity. 

The BBC Radiophonic Workshop also 
regularly replenished its perspective — 
by rotating its personnel. At first, this was 
a deliberate strategy by BBC managers. 
Electronic audio was so unfamiliar when 
the studio formed in 1958 that managers 
felt engineers could only work on sound 
effects “for a limited amount of time 
before succumbing to mental instabil- 
ity”. Niebur reveals the bureaucracy and 
political manoeuvrings inherent in the 
government-funded behemoth that is 
the BBC. Later, many studio hands left 
for better-paid jobs at private companies 
— among them such seminal individuals 
in the history of electronic music as Delia 
Derbyshire and Daphne Oram, the latter 
of whom went on to influence film and 
opera and develop new instruments. 

There are parallels between the two 
books. The Radiophonic faithfully served 
the BBC, and Cage composed reams of 
music for his lover Merce Cunningham's 
dance company. Both institutions pio- 
neered tape loops and other techniques, 
and each associated closely with natu- 

ralists — Cage 


“Managers felt —_ idolized philo- 
engineerscould _ gopher Henry 
only work on David Thoreau, 
sound effects ‘for and the Radio- 
alimitedamount phonic team 
of time before wrote ardently 
succumbing for wildlife 
to mental broadcaster 


David Atten- 
borough. Both 
performed for a global audience, yet their 
paths rarely crossed. 

Differences emerge in their philoso- 
phies. Cage was intellectually highbrow, 
but the Radiophonic members were 
chartered populists. Cage loomed large in 
European universities and cultural insti- 
tutions in Germany, France and Italy. The 
Radiophonic, although equally inventive, 
defined itself by remaining opposed to 
such academic forces. It avoided ideo- 
logical battles about the political nature 
of the orchestra or about musical struc- 
tures such as 12-tone serialism, in which 
Cage participated. The Radiophonic was, 
Niebur says, “the first studio in the world 
to make electronic music accessible to 
ordinary people”. Its legacy lives on in 
the latest return to our screens of Doctor 
Who — featuring yet another rendition 
of that indelible theme tune. m 


instability’.” 


Marc Weidenbaum is a writer based in 
San Francisco, California, and blogs at 
http://disquiet.com. 

e-mail: marc@disquiet.com 


The good life 


BOOKS & ARTS | COMMENT | 


Pascal Boyer assesses what science has to say about morals. 


eighteenth century that one cannot 

derive an ‘ought’ from an ‘is: In The 
Moral Landscape, journalist Sam Harris 
counters this view, arguing that science can 
shed light on why we hold moral values and 
also say which values are valid. In doing so, 
he eloquently counters the jaded pessimists 
who think that science has little to say about 
happiness. His thesis is compelling, but he 
underplays the extent to which our decisions 
are rooted in intuition, preferring to portray 
decision-making as a calculated maximiza- 
tion of our well-being. 

The psychology of morality is a rapidly 
growing field. Investigations of brain proc- 
esses involved in moral intuitions, feelings 
and judgements reveal a suite of sophisticated 
mental capacities, fine-tuned by natural selec- 
tion, that orchestrate our reactions to other 
people's behaviours and to our own. In dis- 
tributing resources, for example, people are 
sensitive to differences in need and merit as 
well as welfare. In most circumstances, people 
experience moral judgements as gut feelings, 
produced by largely unconscious processes. 
Our moral reasoning is thus an awkward 
attempt to rationalize these intuitive feelings. 

This nuanced scientific perspective is far 
from the hackneyed picture of virtue being 
a culturally derived imprint, imposed for the 
benefit of society on an instinctive beast. It 
shows how moral sentiment may have played 
a part in our evolution, conferring fitness 
advantages on ethically inclined individuals. 
But Harris goes further, arguing that humans 
can intuitively distinguish ‘the good life — 
which maximizes individual well-being — 
from less-optimal alternatives. Moral courses 
of action, he suggests, are those that lead us 
closer to this positive ideal. Scientific research 
addresses morality by explaining how our 
actions contribute to our well-being. 

Harris runs against the intellectual grain 
in stating that moral choices can be set on an 
objective scale — those that increase well- 
being versus those that do not — and that 
societies are capable of moral progress, such 
as outlawing slavery or torture. Both views 
oppose the prevailing position of moral rela- 
tivism, which supposes that the suffering is 
justifiable if it fits some local custom. Harris 
punctures such incoherent philosophy easily. 

If morals are fluid, what should be our yard- 
stick of valid choices? Religion is ruled out. 
Harris’ brand of consequentialism — the ends 
justify the means, so what is good is what max- 
imizes well-being — excludes transcendent 


Poreen: David Hume wrote in the 


sources. Because there 
is a quantifiable out- 
come, we do not needa 
deity to know whether 
our choice had good 
or bad consequences. 
Harris points out that 
most of the choices 
that are spread in the 
name of religion, such 


The Moral as proscribed killings, 
Landscape: How stem from intuition 
Science Can and not from the doc- 
Determine Human  tyine followed. Moral 
Values eee: ee 

SAM HARRIS intuition and scientific 
The Free Press: 2010. _ findings are the main 
304 pp. $26.99 bases for making moral 

decisions. 


But science and intuition are not always 
reliable. The case against mugging people is 
easy to make, by quantifying suffering and 
benefit and the net result on the level of well- 
being. But an issue such as abortion is more 
difficult: our feelings are grounded in our 
intuition about whether a fetus is a person. 
Our idea of personhood has evolved as an 
approximation that is sufficient in our social 
interactions. Such ideas typically founder 
on limiting cases — we simply cannot know 
whether a brain-dead individual or an unborn 
fetus is a person or not. Harris sees such 
conundrums as difficult, unsolved scientific 
puzzles that we should pursue. 

A moral optimist, Harris suggests that 
people can be persuaded to abandon harm- 
ful behaviours, such as the stoning of adul- 
terers. Here, social scientists may feel that he 
rides roughshod over some solid findings of 
moral psychology. Consequentialism is not 
the heuristic of most humans. Experiments 
show that assessments of well-being are of less 
importance in moral decision-making than 
a gut feeling that actions are wrong or right. 
For example, beyond its genetic risks, people 
maintain that sibling incest is wrong, even in 
cases where no children result. 

To be persuaded that some actions are 
immoral because they diminish well-being, 
people need to accept that welfare is the most 
relevant criterion of morality, which may 
require a special education. This and many 
other difficulties stand in the way of Harris's 
moral reforms, but they are all reasons to read 
his lucid, deep and uncompromising essay. = 


Pascal Boyer is Henry Luce Professor 
of Individual and Collective Memory at 
Washington University, St Louis, USA. 
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The documentary Project Nim charts an effort by Laura-Ann Petitto and others in the 1970s to teach sign language to a chimpanzee. 


Science shines at Sundance 


Relationships and behaviour are highlighted in this year’s clutch of science films at 
the agenda-setting festival, notes Jascha Hoffman. 


he Sundance Film Festival is known 

| for giving Quentin Tarantino his big 

break as a director, and for discover- 

ing low-budget box-office hits such as The 

Blair Witch Project (1999). This year, the 

stars will share the screen with a Charles 

Darwin impersonator and an expressive 
chimpanzee. 

The annual film festival in Park City, Utah, 
which this year runs 20-30 January, sets the 
agenda for independent cinema worldwide. 
Established to promote US film-makers 
working outside Hollywood, its popularity 
has grown and it now attracts big names. As 
in recent years, several of them have turned 
their cameras on scientific themes. 

One point of focus is the relationship 
between humans and animals. James Marsh, 
the Oscar-winning director of Man on Wire 
(2008), presents his new documentary 
Project Nim, which charts the training of a 
chimpanzee in the 1970s to use hand signals. 
Nim Chimpsky, as the ape was called after 
linguist Noam Chomsky, was thought by 
some to have used its own syntax. The film 
lays scientific controversy bare by reveal- 
ing how others, including the study’s leader 
Herbert Terrace, were more sceptical. 

Animal behaviour gets a comic twist in 
a short film by actor Isabella Rossellini. 
Animals Distract Me tracks a day in the life 
of Rossellini, best known for her role in 
David Lynch's film Blue Velvet (1986), as she 
encounters urban beasts across New York. 
The film follows on from her popular 2008 
series Green Porno and its 2010 sequel Seduce 
Me, in which she plays out the mating habits 
of insects and sea creatures. Further insights 


into evolution come 
from a figure playing 
Charles Darwin, who 
pops up throughout. 

Technology forms 
another theme. Ridley Scott, who directed 
Bladerunner (1982), and Kevin McDonald, 
director of The Last King of Scotland (2006), 
will reveal at the festival the result of their 
project to generate a film entirely from ama- 
teur YouTube footage. Life in a Day is crafted 
from video clips of people’s lives that were 
gathered on 24 July 2010. 

In Connected, award-winning film-maker 
Tiffany Shlain, also the founder of the Webby 
Awards for Internet excellence, explores the 
global connections that have been created 
between people thanks to the Internet. Against 
the backdrop of the death during filming of 
her father, Leonard Shlain — surgeon and 
author of Art and Physics (William Morrow, 
1993) — she asks how texts and tweets are 
changing our lives and relationships. 

Twenty-first-century film technology is 
applied to dramatic visual effect in the fan- 
tastical Polish-Swedish film The Mill and 
the Cross, starring Rutger Hauer, Charlotte 
Rampling and Michael York. Sophisticated 
digital layering and colouring techniques 
bring to life scenes based on an oil painting 
of an old master, Pieter Bruegel’s 1564 The 
Procession to Calvary. The artist and other 
period characters appear on screen as if they 
have stepped off the canvas. 

Scientific lives lie at the heart of other 
poignant films. The Music Never Stopped, 
based on a case study by neurologist 
Oliver Sacks, depicts a young man with 
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amnesia who recovers some of his capacity for 
memory when exposed to music. In 
Letters from the Big Man, a hydrologist is 
faced with a dilemma about whether to 
report her sighting of an ape-like creature 
in the Pacific Northwest. And in HERE, a 
US cartographer confronts uncertainty, in 
both map-making and love, as he works on 
a satellite survey of Armenia. 

Two noteworthy films, co-written by and 
starring newcomer Brit Marling, bring out 
the subtle side of science fiction. In Another 
Earth, a duplicate planet to our own appears 
in the sky, offering a pair of strangers the 
chance to rewrite history on that parallel 
world. In Sound of My Voice, a couple infil- 
trates a cult surrounding a woman who 
claims to have travelled from the future. 
Marling’s films recall another nuanced work, 
Obselidia (2010), about a man compiling an 
encyclopedia of the obsolete, which won last 
year’s Alfred P. Sloan prize for feature films 
about science and technology at the festival. 

With such variety on show, this year’s 
Sloan award judging panel — which 
includes anthropologist Helen Fisher, 
astrophysicist Sean Carroll and neuro- 
scientist David Poeppel — will have to pool 
their experience to pick a winner. “There is 
no all-encompassing theme,” says festival 
panel organizer John Nein of the science 
and nature films. “The strongest common- 
ality is that they offer reflections on what it 
means to be human.” = 


Jascha Hoffman is a writer based in 
San Francisco, California. 
e-mail: jascha@jaschahoffman.com 
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ORRESPONDENCE 


Will foreign-aid 
pledges materialize? 


Much has been made of 

the funding to support 
clean-energy and climate- 
adaptation initiatives in 
developing countries pledged 
by industrialized countries 

at the 2010 United Nations 
Climate Change Conference in 
Cancun, Mexico (Nature 468, 
875-876; 2010). Yet if history 
is any guide, it is questionable 
whether these funds will 
materialize. 

Nine years ago in Monterrey, 
Mexico, the same industrialized 
nations reaffirmed their 
commitment to allocate 0.7% 
of gross national income 
for official development 
assistance (ODA). This target 
was originally mandated bya 
1970 UN general resolution 
that has never been honoured. 
The ODA peaked in 1982, 
when it reached just 0.36% of 
the combined gross national 
income of countries within the 
Organisation for Economic 
Cooperation and Development 
(OECD). 

In 2009, that proportion 
was languishing at 0.31%. This 
shortfall in annual funding 
amounted to US$155 billion, 
which is substantially more 
than the Cancun pledge of 
$100 billion by 2020. If one 
assumes an average annual 
growth rate of 2% for OECD 
economies, by 2020 a further 
$68 billion of the ODA will 
still be ‘owed’ to developing 
countries. 

Even if the Cancun pledges 
are realized by 2020, they will 
offset only half of the deficit 
in the long-promised ODA. 
Without a more generous 
and decisive attitude from 
industrialized countries and 
legally binding commitments 
backed up by sanctions, the 
Cancun pledge will do little to 
raise the hopes of the world’s 
two billion poorest people. 


Mike Hulme University of East 
Anglia, UK. 
m.hulme@uea.ac.uk 


Innovation: venture 
capital is vital too 


Technology-transfer policies 
stemming from the Bayh-Dole 
Act (Nature 468, 755-756; 2010) 
may not always help emerging 
markets. They risk alienating 
the venture capitalists needed 
for commercializing early-stage 
innovation in places where 
governments control patents for 
taxpayer-funded research. 

For example, export controls 
tethered to national technology- 
transfer policy can prevent 
venture capitalists from selling 
investments abroad, hindering 
the growth of a domestic 
industry. Ironically, under- 
developed local capital markets 
discourage capital flow for 
early-stage technology because 
of poor financial returns. This 
diminishes late-stage funding 
from banks and multinationals, 
thereby strangling the entire 
innovation system. 

South Africa's nascent 
biotechnology industry has 
been a victim of such policies 
at the hands of the Technology 
Innovation Agency (TIA) — 
the last remaining funder for 
commercialization. The TIA 
now has an annual budget of 
just US$60 million or so to 
cover health, manufacturing, 
agriculture, mining, information 
technology and industrial 
biotech. BioVentures, the only 
life-sciences fund in sub- 
Saharan Africa, ranks in the 
top tenth of funds globally. It 
returns roughly three times 
the capital invested in it and 
is responsible for several 
successful domestic start-ups. 
But export controls meant 
that it was unable to raise a 
follow-up fund. 

Venture capitalists also act 
as global intermediaries in 


collecting the best research 
and development technologies. 
Even with sufficient 
capital, government-run 
commercialization funders 
can crowd out private-sector 
investment and provide less- 
effective mentoring. This is 
reflected in losses of around 
60 cents per dollar of biotech 
investment by Canadian 
labour-sponsored funds — a 
major source of venture capital. 
Successful innovation 
requires both government 
and venture-capital financing. 
Governments must be patient 
and strike the right balance 
between overall economic and 
domestic innovation. 
Justin Chakma University of 
Toronto, Canada. 
justin.chakma@utoronto.ca 
Stephen M. Sammut University 
of Pennsylvania and Burrill & 
Company, USA. 


Cutting random 
funding decisions 


We find that for one-third of 
grant applications in 2009 to 
Australia’s National Health and 
Medical Research Council, 
success is random owing to 
variability among peer reviewers. 
Increased competition for 
restricted research budgets 
means we must rectify this 
element of chance in selection. 

A quota limiting the number 
of proposals per applicant 
would thwart researchers who 
have a high success rate, while 
improving the odds for others. 
Barring unsuccessful applicants 
for one ‘cooling-off’ round 
is another idea (Nature 464, 
474-475; 2010). 

Simplifying the application 
process would reduce costs 
for both applicants and peer 
reviewers (for example, some 
funding agencies request 
superfluous information). It 
would help in recruiting good 
peer reviewers and cut the 


likelihood of random decisions. 


Funding for projects could 
be retrospective, as in some UK 
research institutions (D. F. Ball 
and J. Butler R&D Mgmt 34, 
87-97; 2004). Researchers would 
complete their research before 
applying, and then use the award 
for their next project. 
Nicholas Graves, Adrian G. 
Barnett Queensland University of 
Technology, Australia. 
n.graves@qut.edu.au 
Philip Clarke The University of 
Sydney, Australia. 


Bioethanol’s dirty 
footprint in Brazil 


We believe that insufficient 
attention is paid to the social and 
environmental costs incurred 

at regional scales by biofuel 
production in Brazil (A. K. 
Duailibe Nature 468, 1041; 2010). 

Brazil's Alagoas state covers 
almost 28,000 square kilometres, 
roughly half of which used to be 
rainforest. Sugar-cane plantations 
have now taken over coastal 
regions, including flood plains. 

A study commissioned by the 
Alagoas government reveals that 
just 13.1% of the state's original 
rainforest has survived 35 
years of the sugar-cane ethanol 
programme. This amounts to 
an average loss of 3,736 hectares 
of rainforest per year in what 
was formerly one of the world’s 
34 biodiversity hotspots. This 
environmental catastrophe is 
already taking its toll. Heavy 
rainfall in the region last year led 
to severe floods that destroyed 
thousands of buildings. 

Despite academic and political 
controversy, most people believe 
biofuels to be ‘clear’ In fact, 
ethanol production leaves a dirty 
footprint in one of Brazil’s poorest 
states. 

Lindemberg Medeiros de 
Araujo, Flavia de Barros Prado 
Moura Universidade Federal de 
Alagoas, Brazil. 
lindemberg@pq.cnpq. br 
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OBITUARY 


John Fenn 


(1917-2010) 


Chemist who enabled mass spectrometry to weigh up biology. 


in Chemistry with Koichi Tanaka and 

Kurt Wiithrich for his development of 
electrospray ionization, which, coupled 
with mass spectrometry, revolutionized 
the identification and structural analyses 
of large biological molecules. With charac- 
teristic wit, Fenn described the discovery as 
having given “wings to molecular elephants”. 
He died on 10 December 2010, aged 93, fol- 
lowing a fall. 

Electrospray ionization enables intact 
proteins to transition effectively from the 
solution to the gas phase. By applying a high 
voltage (2-3 kilovolts) to a fine needle, an 
aerosol spray is produced that turns a protein 
solution into droplets in the submicrometre 
range (0.1-0.4 micrometres). From these 
fine droplets, protein ions of varied 
charges are formed. This enables 
protein masses to be determined 
from the series of mass-to-charge 
peaks in the mass spectrum. 
Many ions with different charges 
enable many measurements, 
so phenomenal accuracy can 
be achieved in the mass deter- 
mined for the intact protein. 

This development opened 
up new areas of research in the 
1990s, including proteomics. 
For example, the molecular mass 
of a 17-kilodalton protein, such 
as myoglobin, could be measured 
with an error ofless than 0.01%. This 
was not possible prior to electrospray 
because existing ionization methods 
were incapable of producing multiply- 
charged protein ions. In a 1989 paper in 
Science, Fenn and his co-authors reported 
obtaining spectra for biopolymers “including 
oligonucleotides and proteins, the latter hav- 
ing molecular weights up to 130,000, with as 
yet no evidence of an upper limit” This turned 
out to be a prophetic statement. 

More than 20 years on, it is now possible 
to obtain well-resolved electrospray mass 
spectra of intact ribosomes and viruses, with 
molecular masses of several million daltons. 
At the time of the initial reports it was not 
clear whether non-covalent interactions 
could be preserved within these gas-phase 
protein complexes, let alone any of their 
three-dimensional structure. It is now clear 
that protein-protein interactions both sur- 
vive the phase transition and retain aspects 
of their native structure. As a consequence, 


Jin: Fenn shared the 2002 Nobel Prize 


electrospray has spawned another unex- 
pected field: gas-phase structural biology. 

Fenn was born into a middle-class 
family in New York City in 1917, later moving 
with his family to Kentucky. He received his 
bachelor’s degree in chemistry in 1937 from 
Berea College. The assistant registrar at Berea, 
Margaret Wilson, was ten years his senior; he 
married her in 1939. With Margaret provid- 
ing the ‘fellowship, in 1940 Fenn received his 
PhD in physical chemistry from Yale Univer- 
sity in New Haven, Connecticut, for work on 
the properties of electrolyte solutions. 


After periods at industrial chemical 
companies, including Monsanto, Fenn 
joined a start-up company, Experiments, 
Inc., in 1945. He contributed to a large 
effort by the US Navy to develop a ramjet- 
powered anti-aircraft missile. The missile 
depended on pressure from high-speed 
flight to compress air enough to allow it to 
expand after combustion to provide thrust. 
Supersonic flight velocities were required to 
achieve sufficient thrust — igniting Fenn’s 
interest in supersonic flows. Continuing 
this research, he moved in 1952 to Prince- 
ton University in New Jersey as director of 
Project SQUID, a jet-propulsion programme 


300 | NATURE | VOL 469 | 20 JANUARY 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


funded by the US Office of Naval Research. 

In 1962 John returned to Yale. Here he 
remained until 1987, doing the majority of 
his Nobel-prizewinning work. In the 1960s, 
Fenn was interested in combustion, flame 
theory and stabilizing flames in high-speed 
flows. He decided that the way to study 
flame reactions was to do molecular-beam 
experiments. By the 1970s, he had realized 
that one way to get bigger molecules into the 
gas phase was to take a solution, disperse it, 
and let the solvent evaporate. So began his 
earliest electrospray experiments. 

Meanwhile, he had to fight offa mandated 
move to a smaller laboratory space and 
efforts to retire him as he passed 70. He 
eventually conceded his space and posi- 

tion and, after a short spell as an emeritus 
professor, moved to Virginia Com- 
monwealth University as a research 
professor in 1994. He believed that 
science should above all be fun and 
that when it ceases to be so you 
should give up. Fenn never gave 
up, continuing his research and 
coming into the department 

almost every day until just a 

few weeks before his death. His 

final paper on the mechanism 
of electrospray was published 

when he was 90. 

In 2005, Fenn lost a dispute 
with Yale over the patent rights 
to electrospray. He claimed that 
Yale had abandoned the tech- 
nique and that he was the person 
who saved it. A personal patent filed 
by Fenn was deemed ‘civil theft’ by law- 
yers for the university. After a decade of 
bitter dispute, and a failed appeal, Fenn was 
ordered to pay costs and damages totalling 
more than US$1 million. 

Electrospray mass spectrometry continues 
to open doors — even in preparative mass 
spectrometry, where proteins and their com- 
plexes can be recovered and visualized after 
their flight in the gas phase. But Fenn’s true 
legacy is the modesty and capacity to inspire 
that leave a lasting memory in all who had the 
pleasure to meet him. = 


Carol V. Robinson, who knew John Fenn 
as a mentor and friend, is a Royal Society 
research professor at the University of 
Oxford Physical and Theoretical Chemistry 
Laboratory, Oxford OX1 3QZ, UK. 

e-mail: carol.robinson@chem.ox.ac.uk 
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NEWS & VIEWS 


FORUM Financial systems 
Ecology and economics 


A growing body of literature deals with the application of theories developed in other disciplines to financial institutions, 
to which a paper in this issue now adds. As outlined here, however, views differ as to its relevance. SEE PERSPECTIVE P.351 


THE PAPER IN BRIEF 

@ The paper’ is entitled ‘Systemic risk in 
banking ecosystems’ and is co-written by an 
expert in banking and an expert in theoretical 
ecology and science policy. 

@ It was prompted by events underlying 

the international financial crisis that began 
in 2007. 

@ It focuses on the network dynamics of 
financial institutions and, in particular, on 
the influence of the pricing of ‘derivatives’. 

@ Derivatives are financial instruments 

that have become fiendishly complex, and 


Proposing policy 
by analogy is 
risky 


paper plane is a wonderful toy model with 
which to explain how real planes fly, and 
water flow is a great analogy for teaching about 
electrical flow through circuits. But without 
business-class seats, a paper plane can never 
be used to explain why two people pay vastly 
different prices for the same flight. Likewise, 
nobody unplugs a television to get a glass 
of water. 
By cross-checking against our everyday 
experience and intuition, we can quickly 
see the limitations of such a toy model and 


that allow investment houses to hedge 
against, and bet on, price movements 

of commodities, bonds, shares and 
currencies without needing to hold the 
underlying asset. 

@ The authors apply models from ecology 
and epidemiology to explore, by a simplified 
‘toy model’ analogy, how an initial bank failure 
can propagate through such institutions. 

@ They offer suggestions on how overall 
system stability can be achieved while 
ensuring that individual banks can maintain 
their necessary economic functions. 


analogy. However, when it comes to the com- 
plexities of the financial sector, our intuition 
(and arguably that of many financial experts) 
is so limited that rigorous statistical validation 
of any toy model or analogy is essential before 
policies are suggested. This is the potentially 
dangerous shortcoming of Haldane and May’s 
paper’. In models of complex systems and net- 
works, tiny changes in the model's assumptions 
— or changes in what it means to be a node, a 
link or ‘infectious’ — can inadvertently invert 
the emergent dynamics, for example by turn- 
ing a stable output into an unstable one. Such 
changes can therefore amplify the inherent risk 
in any resulting policy suggestions. 

There is already substantial consensus that 
policy-makers need to embrace financial- 
market risk within the framework of complex 
dynamic systems”’. However, markets contain 
many heterogeneous objects, the interactions 
of which may change in any number of ways 
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in the blink of an eye (or the click of a mouse). 
This new dynamic regime, in which the char- 
acter of both the links and nodes can change 
on the same timescale*”, lies well beyond 
standard models of ecological food webs, dis- 
ease spreading and networks. The resulting 
dynamic interplay can generate unexpectedly 
large market fluctuations — and it is these 
that invalidate the financial industry’s existing 
approach to the pricing of financial derivatives 
and the management of risk””. 

The financial model® borrowed by Haldane 
and May is an interesting, abstract, complex- 
systems toy model. However, even the model’s 
original creators° emphasized that “In order for 
these kind of models to be more realistic, some 
improvements certainly are needed” They state 
that their focus was on “theoretical concepts” 
whose “relevance for real markets requires 
quantitative estimates of the parameters. Given 
the abstract nature of the model, this appears 
to be a non-trivial task.” They are absolutely 
right. Would you fly in a paper plane that had 
been scaled up to the size of a 747? 

Policy-makers may never fully appreciate a 
model's limitations, so policy suggestions are 
potentially dangerous unless accompanied 
by a quantified health warning of a model’s 
robustness and underlying assumptions, based 
on rigorous statistical testing against state- 
of-the-art financial data sets. Otherwise we 
simply increase risk, rather than reduce it. 
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C. GILLON/GETTY 


Network theory 
is sorely 
required 

THOMAS LUX 


Ha and May’ argue that models from 
ecosystem research can offer valid insight 
for understanding the financial sector. But 
is this too far-fetched an analogy? Can one 
really imagine the regulation of financial 
markets being based on their similarities 
to networks such as food webs? My answer, 
in a sense, is ‘yes’ — we should take these 
similarities seriously. 

This is not to say that we should equate 
banks, and their depositors and hedge funds, 
with some type of schematic predator-prey 
model. It is rather the potential similarities 
between the structural, system-wide prop- 
erties of these very different areas of study 
that we should be interested in. As Haldane 
and May point out, research in biology has 
arrived at quite clear-cut results on the deter- 
minants of the robustness and vulnerability of 
ecosystems. 

By contrast, the modelling of ‘representa- 
tive agents’ in economics has led to a delusive 
neglect of the effects of interaction between 
those agents. By focusing mainly on individ- 
ual optimization of utility or profit, economics 
has lost the perception that “more is differ- 
ent”’ — namely, that higher-level aggregates 
(for instance, the global financial system) can 
have properties that cannot be understood 


solely on the basis of their constituent units 
on a lower hierarchical level (the single bank 
or investor). Built upon this extreme form of 
reductionism, the established framework for 
bank regulation has been exclusively micro- 
oriented and has lacked any consideration of 
systemic risk factors. 

As recent history has shown, system-wide 
effects are important. The default of Lehman 
Brothers in 2008 had the contagious effects 
of a ‘super-spreader disease, and the subse- 
quent domino effect brought the entire finan- 
cial system to the verge of collapse. Systemic, 
‘macro-prudential’ regulation is now an issue 
on the political agenda, and the structure of 
the financial sector has to be scrutinized for its 
stabilizing and destabilizing feedbacks. 

However, the micro-based banking literature 
has little to say on such issues. We know from 
the natural sciences that structurally similar 
connections between micro units might lead 
to similar system behaviour in very different 
areas. It seems essential, therefore, to take 
stock of the accumulated knowledge on net- 
work structures when studying systemic risk 
in the banking sector. The few available phe- 
nomenological studies of particular segments 
of the interbank market in the ‘econophysics’ 
literature have already identified network 
structures that are known to be vulnerable to 
shocks*”. The near-collapse of the overnight 
interbank market at various stages of the recent 
crisis provides an empirical confirmation of 
how susceptible this particular structure is to 
disturbances. 

Connections between financial institu- 
tions are, however, multi-faceted, and only 
part of this complex man-made system has 


Seeing into the future 


The resting brain recapitulates activity patterns that occurred during a recent 
experience, possibly to aid long-term memory formation. Surprisingly, 
corresponding brain activity also occurs before an event happens. SEE LETTER P.397 


EDVARD I. MOSER & MAY-BRITT MOSER 


races of experience can be found in 

the activity of the sleeping brain. One 

region in which such traces are detected 
is the hippocampus, which is required for 
episodic memory in mammals. In rodents, 
for example, most hippocampal neurons fire 
selectively when the animal is in a particular 
location’”. When these neurons — called place 
cells — are active during a particular experi- 
ence, they also tend to be active during subse- 
quent sleep’. The order of firing is also often 
preserved”®. Such subsequent replay of brain 
activity also occurs in the awake state, when 


an animal rests between bouts of running”. 
The recurrence of experience-related firing is 
thought to contribute to the reorganization of 
synaptic connections between neurons during 
memory consolidation*”. 

However, Dragoi and Tonegawa” write 
on page 397 of this issue that hippocampal 
reactivation is not merely a reflection of prior 
experience. They initially recorded sequences 
of firing in place cells when mice that had been 
running on one arm of an L-shaped track were 
resting at food locations near the ends of that 
arm. As expected, place-cell firing sequences 
were replayed during pauses at the food loca- 
tions. The authors then opened the other 
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been mapped in existing toy models. Going 
beyond toy models, an empirical assessment 
of how trading in complex derivatives affects 
the network topology of the banking sector, 
and how that interacts with other linking 
factors (such as interbank credit lines), is 
urgently required”, There is, of course, also a 
need to go beyond the first step of analogies, 
and relatively simple mechanical models, to 
examine the behavioural micro-foundations 
of how the agents involved choose their 
connections in this financial ecosystem. m 
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track’s arm to allow the animals to run across 
the entire L. Surprisingly, they found that when 
the mice were resting before gaining access to 
the second arm, some of the firing sequences 
in their brain matched those subsequently 
recorded on the new arm. They refer to this 
predictive activity as preplay (Fig. 1). 

One might argue that, on exposure of the 
animal to the extended segments, preplay 
merely reflects replay of activity associated 
with the familiar first part of the maze, the 
shape of which was similar to that of the new 
part. But mice with no prior experience on 
any track also showed preplay. Together, these 
observations suggest that, in a new environ- 
ment, activity sequences involving a specific 
assembly of cells are selected from already- 
existing sequences in the network. 

So why have other investigators using similar 
experimental procedures not detected preplay? 
After all, in the most common design, the firing 
sequences that occur during a behaviour are 
compared with resting sequences both before 
and after the behaviour — just as in the present 
study. Dragoi and Tonegawa”’ propose that 
their method of comparing spike sequences 
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Figure 1 | Real dreams. When an animal is resting, sequences of neural activity in its brain resemble 
those that took place during a previous experience, suggesting that the experience is replayed. Dragoi 
and Tonegawa”” show that resting mice also ‘preplay’ activity sequences that are predictive of subsequent 


activity in environments never visited before. 


during rest with template sequences from the 
running epochs may be more sensitive than 
the pairwise correlation procedures used in 
most of the previous work. Indeed, using pair- 
wise correlation methods, these authors!’ do 
not observe preplay either. It would be inter- 
esting to see whether the authors’ template- 
based approach reveals preplay in data from 
past studies. 

Predictive firing implies that, in the hippo- 
campus, internal dynamics play a larger part 
in shaping firing sequences than has often 
been assumed. This deduction fits well with 
theoretical work'"”’ from the 1990s suggest- 
ing that the hippocampus is part of a pre- 
configured network for abstract representation 
of two-dimensional space. The metric for this 
internal map was proposed to be self-motion, 
with environment-specific information being 
added only secondarily — through learning. 

Although self-motion-dependent signals are 
now thought to originate outside the hippo- 
campus””’, this brain region may receive and 
express such signals. Consistent with this idea, 
place cells continue to fire in regular sequences 
when an animal's position is fixed, for example 
when a rat is running ina wheel’. Moreover, 
rat pups exploring an open space for the first 
time show adult-like place-cell sequences'”’’, 
which indicates that path sequences are hard- 
wired in the synaptic-connection matrix by 
either genetic programs or early experience. As 


Dragoi and Tonegawa”” show, in the adult brain 
such pre-wired sequences can be retrieved 
both during sleep and while running in a new 
environment. 

Does preplay rule out experience as a major 
determinant of sequence reactivation? The 
answer is no. Dragoi and Tonegawa report 
that, although the order of firing on the track 
resembled firing sequences both before and 
after the running session, there were roughly 
three times as many replay events as preplay 
events. This observation is in agreement with 
the previously reported enhanced correla- 
tion after a behaviour. It seems, therefore, that 
although some sequences might be hardwired, 
others are shaped by recent experience. 

The fact that replay and preplay occur 
at different times provides further support 
for mechanistic segregation. Factors that 
control when replay and preplay take place 
should be identified, as well as any interaction 
between these activites. For instance, are 
replay sequences moulded from pre-existing 
templates expressed during preplay? And, if so, 
do such preconfigured sequences contribute to 
generalization of experience between similar 
environments”? 

Dragoi and Tonegawa” show that as many 
as 15% of the spiking events that occurred dur- 
ing resting correlated significantly with sub- 
sequent sequences in the new environment. 
This relatively large number of predictive- 
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spiking events is worthy of some reflection. If 
preconfigured sequences were used as a raw 
material to encode sequences in new environ- 
ments, and each environment received a unique 
set of firing sequences, then the hippocampus 
might soon run out of templates. It is possible, 
therefore, that replay and preplay originate from 
different neuronal pools, and that the fraction 
of the two event types remains constant. 

One possibility is that replay depends on 
experience-sensitive associative networks in 
the CA3 area of the hippocampus, whereas 
preplay reflects rigid, environmentally neu- 
tral path sequences coming in from regions 
outside the hippocampus such as the medial 
entorhinal cortex””’. Such hardwired ento- 
rhinal sequences could also account for the 
presence, in hippocampal-cell ensembles, of 
activity sequences corresponding to routes 
never taken by the animal’*. Another possibil- 
ity is that preplay reflects a general tendency for 
neurons to carry over firing patterns from one 
environment to the next when the two differ- 
ent environments are visited close together in 
time. Indeed, such carry-over effects have been 
observed” in the CA1 area of the hippocampus. 
Whether predictive activity disappears at 
intervals longer than the time constant of CA1 
hysteresis should be determined. Researchers 
will undoubtedly address these possibilities, 
as well as the mechanisms and functions of 
preplay more generally. = 
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ASTROPHYSICS 


How galaxies got their 


black holes 


The massive compact objects in the centres of galaxies developed in at least two 
ways. One seems to be a natural result of galaxy formation in the Big Bang theory 
of the expanding Universe — the other is enigmatic. SEE LETTERS P.374 & P.377 


P. JAMES E. PEEBLES 


large galaxy usually has a central, com- 
A= massive object, termed a relativis- 

tic black hole for want of a better idea 
of what it is, that can produce great bursts of 
energy. When the black hole is surrounded by 
a cloud of old stars — the ‘bulge’ of the host 
galaxy — its mass is a few per cent of the mass 
of the bulge. This relationship, observed for 
black-hole masses ranging from about 1 mil- 
lion to 1,000 million solar masses, suggests that 
black holes and bulges evolved together. Other 
large, spiral-shaped galaxies have black holes 
of respectable mass (1 million to 100 million 
solar masses) and no perceptible bulge. In two 
papers in this issue, Kormendy et al.'” explore 
clues to how these bulgeless, or pure-disk, 
spirals and their black holes formed. 

Most of the mass in galaxies does not exist 
in the form of stars but in a halo of dark mat- 
ter (matter different from the hydrogen and 
heavier elements of which we and the stars are 
made). There is a substantial literature on the 
elegant idea that dark matter controls the size 
of the central black hole, but Kormendy and 
Bender’ (page 377) show that this cannot be 
so. The nearby bulgeless spiral galaxy M101 
(Fig. 1) illustrates the situation: if M101 hasa 
central black hole, its mass must be tiny relative 
to that of the black holes of other spirals with 
similar dark-matter haloes. 

Bulges and their black holes seem to be a 
natural consequence of structure formation 
in the hot Big Bang theory of the expanding 
Universe. According to this theory, galax- 
ies grew by gravitational assembly of matter 
into clumps that gathered into larger clumps, 
and so on to galaxies. In galaxies with bulges, 
including ellipticals, which have bulges and no 
disks, the mass of the central black hole cor- 
relates not only with the mass of the bulge, but 
also, as Kormendy, Bender and Cornell’ note 
(page 374), with the average spread of veloci- 
ties of the bulge stars (see Fig. 2a on page 375). 
The plausible explanation is that part of the gas 
out of which bulge stars formed settled instead 
near to the black hole, in part increasing its 
mass and in part fuelling explosions that blew 
the gas away and suppressed bulge-star forma- 
tion. That is, the growth of bulge and black hole 
may have controlled each other. The timing 


looks right. Bulge stars are old: they formed 
when the expanding Universe was roughly a 
third of its present size (redshift about 2). This 
is when the rate of star formation per unit 
of matter was near its maximum (more than 
10 times the present rate’). It is also when 
quasars — explosions powered by the central 
black holes — were most abundant (100 times 
more common than now’), probably an explo- 
sive result of overfeeding of the black holes as 
the early generations of stars were forming. 
In addition to the evidence that black holes 
and bulges co-evolve, there are indications of 
the possibly related evolution of other com- 
ponents of galaxies. The mass distribution in 
a spiral galaxy varies smoothly from its outer, 
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dark-matter-dominated parts to its inner 
parts, where the mass in stars is important. 
That is, the dark matter seems to have hada 
strong influence on the formation of the spiral 
galaxy’s disk of gas, stars and dust — but not on 
the formation of the central black hole. 

Another plausible example of co-evolution is 
the growth of a black hole and a pseudobulge, 
a concentration of starlight near the centre 
of the galaxy, but in the disk, not extending 
above it as do stars in a bulge. Pseudobulges 
may be the accumulation of disk material 
that migrated towards the galactic centre, 
some of it tumbling all the way in to feed the 
black hole’s growth. Here, however, we lack a 
signature. Kormendy et al.' find that, unlike 
the case for galaxies with bulges, for pure-disk 
galaxies with pseudobulges, such as M101, 
the properties of the disk, pseudobulge and 
black hole are not closely related. A challenge 
for the advancing power of theoretical meth- 
ods*® is to understand this inward migration 
of matter, and why it preferentially fed the 
pseudobulge in some galaxies and the black 
hole in others. 

There are roughly equal numbers of nearby 
large galaxies with and without bulges’. For 
example, the galaxy next to ours, M31, has a 
prominent bulge and a black-hole mass close 
to 100 million solar masses, whereas our 


Figure 1 | The pure-disk galaxy M101. The spiral arms in this nearby galaxy are in a rotating disk of 
stars and gas seen nearly face-on. The dark streaks are lanes of dust that absorb starlight. The inset is an 
enlarged view of the central region, and shows dust lanes extending to the tiny central star cluster. Because 
dust settles near to the disk, the absorption of starlight shows that most of the stars are close to the disk too. 
That is, this galaxy does not have the bulge of old stars extending above the disk that is a prominent feature 
of some galaxies. If there is a black hole in the centre of M101, it is tiny compared with black holes in other 
galaxies of similar mass. This exemplifies Kormendy and colleagues’ argument” that the dark-matter halo 
of a galaxy has little influence on the formation of its central black hole. The authors also point out that 
pure-disk galaxies are not uncommon, and that they managed to grow black holes without possessing the 
bulge that is thought to funnel the mass that grows the black hole of a galaxy that has a bulge. 
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Galaxy, which has a similar dark-matter halo, 
is bulgeless, and the black-hole mass is only a 
few per cent of that in M31. Therefore, the local 
environment, which was probably similar for 
these neighbouring galaxies, does not seem to 
bea significant factor in determining whether 
a galaxy acquires a bulge and its attendant 
black hole. 

In theory, galaxies both with and with- 
out bulges were growing by the gravitational 
collection of clumps of matter when the star- 
formation rate was near its peak. That would 
suggest that the clumps contained stars; a 
recent discussion puts roughly comparable 
masses in stars and gas®. So where are these 
early generations of stars? Not in disks, because 
there is nothing that would slow the motion 
of a star to allow it to settle onto a disk. Bulges 
contain old stars, and it has been suggested 
that this is where the early stars ended up. 


But we now see that this is not plausible: why 
would these old stars have avoided our bulge- 
less Galaxy and settled instead in the bulge of 
our neighbour M31? Maybe the old stars are 
in diffuse stellar haloes. If so, it seems curious 
that the stellar halo of our Galaxy is much less 
prominent than that of M31. But more stud- 
ies of other nearby galaxies will be required 
to check for inventories of stars that are old 
enough and abundant enough to account for 
stars that formed before disks. 

Our standard galaxy-formation theory is 
informed by the hot Big Bang cosmological 
model. The demanding tests that this model 
passes® show that we must take it seriously, 
but so far it has not offered much guidance 
for understanding the evidence Kormendy 
et al.'* present. And because the Universe 
has surprised us before, I would not ignore 
the possibility that the cosmological model 


Drugs that prevent 
HIV infection 


Two human trials investigate the efficacy of a type of antiretroviral drug — 
usually used to treat HIV-infected individuals — in preventing HIV infection. 


The results are heartening. 


MARK A. WAINBERG 


ach year, several million people become 
Hines with HIV — the agent respon- 

sible for AIDS. This formidable statistic 
highlights the need for drugs that could be 
given as a preventive measure to vulnerable 
populations. Such drugs would be as valuable 
as those currently used to prevent infection 
after recent possible exposure to the virus and 
to treat the symptoms of HIV infection. Two 
papers, one published in Science’ and the other 
in The New England Journal of Medicine’, now 
report on a class of antiretroviral drug that can 
prevent HIV infection in a significant propor- 
tion of individuals. Although previous animal 
studies have shown evidence for such effects*", 
the current trials provide the first proof of 
principle for the approach in humans. 

The two studies used different modes of 
drug administration to achieve protection. In 
the first, a microbicide-based approach car- 
ried out in South Africa, Abdool Karim et al.’ 
asked roughly 450 women who were at risk of 
acquiring HIV to self-administer a vaginal gel 
impregnated with the drug tenofovir before 
sexual intercourse. The authors report a 39% 
reduction in HIV transmission in these women 
over 2.5 years compared with a similar number 
of women who received a placebo gel. 
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Notably, Abdool Karim and colleagues 
point to an excellent correlation between 
adherence to the use of the tenofovir gel 
and the extent of protection. The levels of 
protection in women who used the gel at 
least 80% of the time, 50-80% of the time, or 
less than 50% of the time were 54%, 38% and 
28%, respectively. An added benefit was that a 
high proportion of the women who used the 
gel were also protected from infection by an 
unrelated virus — herpes simplex virus. 

Future work will doubtless make use of com- 
binations of antiviral drugs in vaginal gels. For 
instance, a gel based on both tenofovir and 
another antiretroviral drug, emtricitabine, 
should perform better than one based on either 
agent alone. Indeed, in the second study Grant 
et al.” investigate the efficacy of tenofovir and 
emtricitabine, in this case as a co-formulated, 
daily administered single pill. 

The researchers gave the pill, or a placebo 
pill, to nearly 2,500 sexually active homo- 
sexual men as oral pre-exposure prophylaxis 
(PREP). Over the following median period of 
1.2 years, they documented a 44% reduction 
in HIV acquisition in the experimental group 
compared with subjects receiving the placebo. 
Although Grant and colleagues did not study 
the efficacy of their pill among vulnerable 
women, similar results would be expected. 
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requires fine adjustment to account for a 
relatively small detail — the galaxies. = 
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Figure 1 | Prevention is better than cure. Once 
HIV enters a host cell, the virus’s reverse 
transcriptase enzyme converts its RNA genome 
into DNA. The viral DNA then becomes 
integrated into the cellular genome to facilitate 
HIV replication. Two clinical trials'* show that 
the drugs tenofovir and emtricitabine — which 
potently inhibit HIV reverse transcriptase, thereby 
blocking viral replication — can prevent the 
initiation of persistent infection in uninfected 
individuals. These drugs, however, cannot 
eliminate cells already infected with the virus. 


Like Abdool Karim and co-workers, Grant 
et al. note the importance of adherence to the 
drug regimen: their pharmacological data 
reveal detectable levels of both tenofovir and 
emtricitabine in only 3 (9%) of the 34 individ- 
uals who became infected, in contrast to 22 
(51%) of 43 people who remained uninfected. 


This common message of both papers is highly 
encouraging, and should help to promote 
adherence among populations at risk. 

Another common theme of the two stud- 
ies reinforces the idea that drugs that quickly 
reach high levels in the blood and other body 
compartments are more effective than those 
that do not: high levels of each of tenofovir 
and emtricitabine are maintained in the rele- 
vant body fluids and tissues for at least 24 hours. 
The idea has now prompted some investigators 
to explore the alternative approach of inter- 
mittent PREP, whereby tenofovir and emtri- 
citabine are taken only on the morning before 
the anticipated sexual intercourse, rather than 
every day. It is noteworthy that, for ethical rea- 
sons, all of these studies encourage participants 
to also use condoms, although generally many 
of the subjects do not comply. 

The antiretroviral drugs used in the current 
studies’” act on the HIV reverse transcriptase 
enzyme (Fig. 1). Further research might 
instead focus on drugs that target other mol- 
ecules involved in the virus’s life cycle; attrac- 
tive candidates include compounds that block 
HIV attachment to target cells. One such drug, 
maraviroc, which is approved for treating HIV 
infections, has been licensed for use in the 
development of microbicides, and there is a 
strong rationale for its use in oral PREP. 

It is not surprising that antiviral drugs can 
prevent sexual transmission of HIV. It has long 
been known that HIV-infected women who 
receive such drugs during pregnancy have a 
greatly reduced risk of transmitting the virus 
to their babies*. Moreover, both oral-PREP and 
microbicide studies” in monkeys have shown 
that either tenofovir or joint administration of 
tenofovir and emtricitabine can prevent infec- 
tion with the HIV-related virus SIV. What's 
more, individuals whose HIV infection is well 
managed with efficient use of antiretroviral 
drugs are relatively non-infectious to their sex- 
ual partners, even if condoms are not used®”. 

The new work’”, however, could potentially 
take us beyond the 60% success rate in prevent- 
ing HIV transmission that has been obtained 
through male circumcision”, because the latter 
procedure remains difficult to implement in 
low-income countries. The use of a pill may 
be the easiest option of all. 

Some dangers remain, nonetheless. For 
instance, if antiviral drugs — administered as 
either microbicides or oral PREP — are given 
to people who are already infected with HIV 
but not yet diagnosed, it might result in drug 
resistance. This is because, given alone, neither 
tenofovir nor co-administered tenofovir and 
emtricitabine can fully suppress viral replica- 
tion, but can potentially create an opportunity 
for selection and subsequent transmission of 
drug-resistant HIV strains"’. This issue may be 
more relevant for oral PREP than for microbi- 
cides, particularly if the gel formulations used 
for the latter can prevent the systemic absorp- 
tion of the drugs that could lead to the selection 


of resistant viruses. This potential problem 
notwithstanding, the results of these two 
trials represent marked progress in research to 
prevent HIV infection. = 
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Spot the hotspot 


Plasmonic hotspots — nanometre-sized crevices that permit the detection of single 
molecules — are too small to be imaged with conventional microscopes. They can 
now be probed using super -resolution fluorescence microscopy. SEE LETTER P.385 


MARTIN MOSKOVITS 


hen light is shone on molecules 

placed on appropriately nano- 

structured gold or silver surfaces, 
the light scattered from the molecules is often 
amplified by factors of a million — or even a 
billion. This amplification effect is the work- 
ing principle of surface-enhanced Raman 
spectroscopy (SERS)’”, and allows the tech- 
nique to detect and identify very small samples 
of molecules, often down to single molecules. 
Among researchers, there is a consensus that 
the effect arises because the nanostructured 
metals can harvest the incoming light in their 
environments — much like an antenna collects 
radio waves — and concentrate the light’s elec- 
tromagnetic energy in ‘plasmonic hotspots, 
often nanoscale clefts, gaps and fissures. How- 
ever, although the optical field within a hotspot 
has been the subject of numerous calculations 
and simulations, direct measurements of the 
field's profile have been hard to carry out. On 
page 385 of this issue, Cang et al.’ show how 
they have managed to do just that. 

The most effective hotspots — those yield- 
ing the highest light enhancement — are a 
few nanometres in diameter. However, the 
normal image resolution of a conventional 
optical microscope, which is limited to half the 
wavelength of the incident light, is tenfold too 
coarse to probe such small areas. Even uncon- 
ventional imaging devices, such as near-field 
scanning optical microscopes, are limited by 
the size of their probes, which rarely fall below 
30 nanometres’. 

Enter Cang and colleagues’, who adapted 
the technique of single-molecule, super- 
resolution optical fluorescence microscopy” 


to probe SERS hotspots. This method makes 
use of the fact that, although the diameter of 
the apparent image of any given molecule is 
much larger than the molecule (in fact, it is 
at least as large as half the wavelength of the 
light emitted by the molecule), if the individ- 
ual molecules of a molecular ensemble can be 
made to fluoresce one at a time, the diameter 
of the aggregate image becomes much smaller. 
Indeed, the technique can yield image resolu- 
tion of less than 2 nm. 

To image single hotspots on the surface of 
a metal, Cang et al. submerged the metal in a 
solution of fluorescent dye molecules, and by 
choosing the appropriate concentration they 
ensured that, on average, molecules would 
arrive at the hotspot one at a time. On bind- 
ing to the hotspot, a molecule’s fluorescence 
increases greatly and appears as a bright spot, 
the intensity of which is a measure of the 
enhancement. The adsorbed dye molecule’s 
fluorescence dies out within hundreds of milli- 
seconds of its arrival at the hotspot, and the 
hotspot is then ready for the next adsorption 
event. 

In this way, Cang and colleagues’ were able 
to image the fluorescence enhancement profile 
of single hotspots as small as 15 nm with an 
accuracy of 1-2 nm. A recent study by Strana- 
han and Willets®, which was based on an imag- 
ing technique similar to that used here, has 
also reported single-hotspot images, but with 
a resolution of the order of 10 nm. 

For hotspots in silver nanostructures, 
Cang et al. measured fluorescence-intensity 
enhancements in excess of 130. Fluorescence 
enhancements are significantly lower than 
those of SERS for two reasons. First, in fluor- 
escence only the incident field is enhanced, 
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whereas SERS enhancement derives from the 
amplification of both the incident and scattered 
fields — in fact, the SERS enhancement effect 
is really enhanced scattering of the molecules’ 
Raman emission by the metal. Second, the 
fluorescence of molecules in close proximity 
to a metal is quenched to some extent’. 

Cang et al. also demonstrated that the SERS 
enhancement effect correlates inversely with 
the size of the hotspot. That is, the optical field 
was most strongly concentrated in the small- 
est hotspots, an effect that had previously been 
predicted by simulations but not measured 
directly. Taken together with the earlier report 
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of Stranahan and Willets’, the authors’ results” 
provide direct evidence for optical-field concen- 
tration in hotspots, which has previously been 
supported by only indirect — although compel- 
ling — evidence. A current challenge for SERS 
is the engineering of substrates with a very large 
density of hotspots, which could function as 
highly sensitive biomolecular sensors. With the 
demonstrated ability to image hotspots directly, 
that goal seems within closer reach. m 
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Farming writ small 


Social slime moulds graze on bacteria, but save some for transmission in 
their spores. Strains practising this primitive form of farming coexist with 
non- farmer strains in an intriguing cost-benefit equilibrium. SEE LETTER P.393 


JACOBUS J. BOOMSMA 


domestication of crops and livestock 

enabled bands of human hunter-gatherers 
to transform into the computerized, space- 
exploring societies of today. A predictable 
food supply ultimately turned out to bea great 
benefit, but it has remained unclear how the 
initial transitions to farming prevailed in the 
face of the inevitable infrastructural costs and 
vulnerability to disease’. Societies of fungus- 
growing ants and termites, and wood-boring 
ambrosia beetles, have also evolved advanced 
agriculture’; here, too, the absence of surviv- 
ing lineages in which farming and non-farming 
co-occur make it difficult to evaluate how these 
transitions became irreversible. Brock et al. 
(page 393 of this issue)’ now add to the picture. 
Their demonstration that Dictyostelium slime 
moulds practise bacterial husbandry sheds 
light on the trade-offs governing incipient 
domestication. 

Dictyostelium exist as soil-dwelling, uni- 
cellular amoebae that can reproduce sexu- 
ally’. But they are best known for their social 
reproduction, during which cells aggregate 
and form a motile multicellular slug that 
subsequently produces a fruiting body con- 
taining asexual spores’ (Fig. 1). This happens 
whenever the amoebae have exhausted their 
local bacterial prey, on which they feed in a 
quite catholic manner’. One species — Dictyo- 
stelium discoideum — isa particularly impor- 
tant model system, both for developmental 
biologists studying the origin of multicellular- 
ity’ and for evolutionary biologists interested 
in the conflicts and cooperation that occur 
under the conditions of kin selection®. Brock 


I: the space of just a few millennia, the 


and colleagues’ show that a significant frac- 
tion of D. discoideum spores carry bacteria to 
inoculate new habitat with food, but that such 
husbandry has remained clone-specific, with 
only about one-third of the strains examined 
practising it. 

The authors’ molecular and experimental 
evidence indicates that strains are either 
farmers or non-farmers and that spore-borne 
bacteria confer substantial fitness benefits on 
Dictyostelium when spores land in patches with- 
out suitable food. However, this form of farming 
has significant costs because patches cannot be 
exhaustively grazed before cells aggregate for 
reproduction. Slugs loaded with bacteria also 


Figure 1 | Stages in the life cycle of Dictyostelium 
discoideum. Main picture: light microscope image 
(x 50) of the motile slug, an aggregation of many 
individual amoebae. Inset: a scanning electron 
micrograph (x 10) of spore towers, the asexual 
reproductive structures into which slugs form 
under starvation conditions. Brock et al.’ show 

that certain strains of D. discoideum husband their 
bacterial food by not grazing bacteria to exhaustion 
and carrying some to new sites in their spores. 
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travel less far, and investments in co-transmis- 
sion do not pay off when spores germinate in 
places where food bacteria are present. These 
context-dependent benefits make the Dictyo- 
stelium farming polymorphism an example of 
a mixed evolutionarily stable strategy’. This 
is a fascinating finding, because the Dictyo- 
stelium-bacterial symbiosis is evidently driven 
by mutualistic advantages, despite the obvious 
risks of bacterial exploitation of the dispersal 
opportunities provided by the hosts. 
Dictyostelium slime moulds belong to an 
ancient group, sister to fungi and animals 
combined’. So, despite the long evolutionary 
timescale, why have the benefits of farming 
remained such a mixed bag? It seems that what 
has stalled developments towards unambigu- 
ously profitable farming is that slime moulds 
did not become specialized on a single bacterial 
species, which could then have co-evolved and 
co-speciated with its host and lost its free-living 
stages. Slime mould lineages did adapt to farm- 
ing, but their bacterial ‘crops’ have remained 
under selection for independent growth. There 
is no substrate facilitation, growth reinforce- 
ment, or removal of competitors or weeds, as 
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seen in the classic fungus-farming symbioses” 
and in the more recently discovered incipi- 
ent practices of fungus farming by Littoraria 
snails’ and red-alga farming by Stegastes 
damselfish””. 

The damselfish mutualism has led to at least 
one case of monoculture in which the crops 
no longer have free-living relatives. The fact 
that slime mould husbandry has not achieved 
this status underlines the point that parent- 
offspring transmission is insufficient to install 
absolute co-dependency in a mutualism”. 
Some form of monoculture farming is appar- 
ently essential to make symbionts put all their 
eggs in a host's basket, because being eaten is 
profitable only when it benefits clone mates 
that are nursed and dispersed. Monocultural 
commitment makes these kin-selected ben- 
efits consistent, paving the way for mutual 
coadaptation, irrespective of symbionts being 
acquired from the environment rather than 
being inherited”’. The limitations of bacterial 
husbandry in slime moulds’ therefore clarify a 
major cornerstone of our general understand- 
ing of mutualistic interactions. They also invite 
further study to unravel the molecular mech- 
anisms that allow or prevent bacterial trans- 
mission, and to establish the dynamics of food 
transmission in slugs that are genetic mixtures 
of several strains®. 

The Dictyostelium symbiosis presents an 
interesting analogy to culturally adjustable 
human subsistence farming in its various 
contemporary and historical combinations 
with hunter-gatherer strategies’. In both slime 
moulds and humans, farmers did not become 
reproductively isolated from non-farmers, 
nor did crops or livestock lose the possibil- 
ity of hybridizing with wild relatives, as has 
happened in the specialized insect fungus- 
farming symbioses”””. The slime moulds may 
have insufficient multicellular complexity to 
evolve specialized nurturing traits for par- 
ticular crops, whereas our own species lacked 
evolutionary time and consistent selection for 
extreme crop specialization. Neither of these 
constraints applied to the farming societies of 
ants and termites. 

Although Dictyostelium do not actively rear 
their crops, they may well possess unknown 
adaptations that, if revealed, would illuminate 
fundamental questions of conflict and cooper- 
ation across species boundaries. The ancestors 
of these slime moulds were among the earliest 
colonizers of terrestrial habitats, so the history 
of this bacterial husbandry symbiosis may go 
back further than any other farming system. 
Mapping farming practices on a large-scale 
evolutionary tree of the slime moulds would 
therefore be a worthy objective. If this farm- 
ing symbiosis turns out to be ancient, our 
new understanding of Dictyostelium biology 
could be summed up in a lyric of the rock 
band Metallica. To paraphrase: wherever 
they roamed, they redefined the unknown, by 
themselves but not alone. m 
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Peptide gets in shape 
for self-defence 


The transformation of tadpole to frog and of caterpillar to butterfly are two of the 
more obvious examples of metamorphosis. But molecular shape-shifting may occur 
in each of us as part of our innate antibacterial defence system. SEE LETTER P.419 


ROBERT |. LEHRER 


mong the immune mediators that 

fight microorganisms within us, one 

is human §-defensin 1 (hBD-1). This 
peptide, which was first described’ in 1995, 
is continually expressed in skin and epithelial 
cells throughout the body’. But because its 
direct antibacterial properties are modest, the 
reason it stands guard at interfaces between 
microbe-laden environments — such as the 
colon or skin — and their adjacent, normally 
sterile tissues has remained enigmatic. On 
page 419 of this issue, Schroeder et al.’ show 
that the mild antibacterial activity of hBD-1 
changes drastically after it undergoes a chemi- 
cally induced change in shape. 

The peptide backbone of hBD-1 is folded 
into a well-defined structure that is held 
together by three internal disulphide bonds’. 
Schroeder et al. find that an enzyme called 
thioredoxin reductase can sever these disul- 
phide bonds in a reduction reaction. The 
reduced hBD-1 molecule undergoes a pro- 
found change in shape (Fig. 1, overleaf) that 
allows it to kill some Gram-positive bacteria, 
against which its normally oxidized form is 
powerless. The authors’ electron micrographs 
of the slain bacteria show changes that might 
lead a forensically inclined microbiologist to 
wonder whether reduced hBD-1 induced the 
bacteria to self-destruct, by triggering their 
latent autolysis (self-breakdown) systems’. 

Schroeder and co-workers’ observations” 
are unlikely to be merely a ‘test-tube’ phenom- 
enon. They demonstrate that in human epithe- 
lia, oxidized hBD-1 and thioredoxin reductase 
colocalize with reduced hBD-1. 


Ifshape change alone imparted the expanded 
antimicrobial range of hBD-1, then analogues 
of this peptide in which cysteine residues are 
replaced by other amino acids should also 
show enhanced function, because the cysteine- 
free peptides would be unable to form shape- 
restraining disulphide bonds. But Schroeder 
et al. report that such analogues are ineffective 
antibiotics. Evidently, there is something spe- 
cial about the cysteine residues of hBD-1, but 
exactly what remains unknown. 

There is also something special about 
having a positive charge. Full-length hBD-1 has 
36 amino-acid residues, including six cysteines, 
one negatively charged aspartic acid and five 
positively charged residues (four lysines and 
one arginine). If the partial charge of its single 
histidine residue is ignored, hBD-1 has a net 
charge of +4. This charge is concentrated in 
its carboxy-terminal octapeptide — arginine- 
glycine-lysine-alanine-lysine-cysteine-cysteine- 
lysine. A truncated variant ofhBD-1 lacking the 
last seven of these residues shows no activity, 
but a seven-residue peptide lacking only the 
initial arginine of the octapeptide does. 

It is ironic that the bacteria that Schroeder 
et al. find to be susceptible to reduced hBD-1 
belong to either the lactobacilli or bifido- 
bacteria genera — organisms that are generally 
considered’ to be health-promoting probiotics 
rather than potential pathogens. This observa- 
tion, however, should be considered a proof 
of concept, rather than a serious colonic 
conundrum. 

At least 1,000 species of bacteria reside in the 
colon of a healthy adult, and a single gram of 
faeces may contain up to 10” bacteria. Unlike 
bifidobacteria, most bacterial species residing 
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a Oxidized hBD-1 


b Reduced hBD-1 


Figure 1 | Computer-generated structures of human 6-defensin 1. a, In its oxidized form, the peptide 
human £-defensin 1 (hBD-1) contains three disulphide bonds. The peptide backbone is shown in green 
except for cysteine residues (yellow) and the six non-cysteine carboxy-terminal residues (pink). The broad 
arrows represent }-sheet components found only in the oxidized form. b, The reduced hBD-1 structure 
was generated from the lowest-energy conformation of oxidized hBD-1, after breaking its disulphide 
bonds and causing it to assume a random conformation. (Structure generated by Alan J. Waring, 


Univ. California, Los Angeles.) 


in the colon cannot be grown in culture, and 
their presence can only be disclosed using 
various ‘gene-sniffing’ techniques. It could be, 
therefore, that the effects of reduced hBD-1 on 
probiotic bacteria are simply collateral damage 
on these harmless bystanders by a defence 
system that also targets less-well-intentioned 
intestinal residents or transients. Alternatively, 
even probiotics may require surveillance to keep 
them from overstepping their boundaries. 
Ideally, the activity of an antibiotic should be 
examined in a defined medium, the composi- 
tion of which closely resembles, or precisely 
replicates, the in vivo environment. With pre- 
cise simulation of the colonic content being 
too challenging to contemplate, it would be 
informative to learn how defined factors — 
such as pH and, in particular, salinity — affect 


positive charge of these four peptides varies 
from +4 for hBD-1 to an astounding +11 for 
hBD-3, whose eight carboxy-terminal residues 
alone carry a net charge of +6. From previous 
work" on hBD-3, its high net positive charge 
contributes substantially to the peptide’s ability 
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to kill bacteria or fungi such as Candida albi- 
cans. This is especially true when the assays are 
performed in media of low ionic strength. 

Given the extreme cationicity and high 
intrinsic activity of oxidized hBD-3, it is not 
surprising that when Schroeder et al.* removed 
its disulphide bonds, they did not detect 
improved activity of this peptide against 
bifidobacteria. After all, a bacterium can only 
be killed once. For thioredoxin reductase to 
empower hBD-3 to do so twice would be a 
reductio ad absurdum. = 
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Catalytic detoxification 


Protein engineering of an enzyme that catalytically detoxifies organophosphate 
compounds in the body opens up fresh opportunities in the search for therapeutic 
protection against nerve agents used in chemical warfare. 


the activity of reduced hBD-1 in vitro. Another 
interesting experiment would be to test the 
antibacterial activity of mixtures of reduced 
and oxidized hBD-1, because clearly such 
mixtures occur in vivo. 

It remains unknown whether Schroeder 
and colleagues’ results are unique to hBD-1 or 
whether they are also true for other defensin 
peptides. Defensins and defensin-like peptides 
are fairly universal participants in host defence 
against infection’: they occur in plants, fungi, 
invertebrates and vertebrates. Vertebrates have 
three subfamilies of defensins (designated a, B 
and 6)*, the members of which consist exclu- 
sively of cationic peptides with six cysteines 
and three disulphide bonds, which also provide 
resistance to premature proteolytic digestion. 

Although more than 20 genes have been 
identified’ that encode hBDs, only hBDs 1-4 
have received extensive attention. The net 
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rganophosphates are among the most 

toxic compounds that have been chem- 

ically synthesized. Since the discovery 
of their biological activity in the 1930s, these 
compounds have found use as broad-spectrum 
insecticides for agricultural and domestic appli- 
cations. But organophosphates have also been 
developed as chemical-warfare agents, includ- 
ing VX and the ‘G-agents’ (suchas sarin, soman 
and cyclosarin). Because these compounds are 
relatively easy to synthesize, their use by inter- 
national terrorist groups is a serious threat. 
Current protocols for the prevention and treat- 
ment of organophosphate poisoning are largely 
ineffective, and so new strategies are desper- 
ately needed. Reporting in Nature Chemical 
Biology, Gupta et al.’ describe an approach that 


310 | NATURE | VOL 469 | 20 JANUARY 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


might one day find use in preventing organo- 
phosphate poisoning. 

Organophosphates are highly toxic because 
they rapidly inactivate acetylcholinesterase 
(AChE), an enzyme required for nerve func- 
tion (Fig. 1). AChE breaks down (hydrolyses) 
acetylcholine, a neurotransmitter that relays 
nerve impulses to muscles and other organs. 
Organophosphates form a covalent bond toa 
serine amino-acid residue in the active site of 
AChE, stopping the enzyme from function- 
ing. The subsequent build-up of acetylcholine 
blocks cholinergic nerve impulses, leading to 
paralysis, suffocation and death. 

Various prophylactic approaches have 
been developed to diminish the toxic effect 
of organophosphates. Atropine, for example, 
is a competitive antagonist for muscarinic 
acetylcholine receptors — it blocks the action 


of acetylcholine, thereby reducing the effec- 
tive concentration of the neurotransmitter. 
Alternatively, chemicals such as pralidoxime 
react with AChE-organophosphate adducts to 
regenerate catalytically active AChE. 

A relatively new approach for reducing the 
concentration of organophosphates in the 
blood is to inject human butyrylcholinesterase 
(BChE) directly into the bloodstream of a 
poisoned individual’. This enzyme reacts with 
organophosphates in the same way as AChE, and 
thus acts as a selective scavenger for the nerve 
agents. But a problem with this approach is that 
the scavenging reaction is stoichiometric — 
one BChE molecule is required to scavenge one 
molecule of organophosphate. This means that 
a substantial amount of the enzyme must be 
injected into the body to reduce a lethal dose to 
non-toxic levels: approximately 350 milligrams 
of BChE are required to detoxify every milli- 
gram of cyclosarin, for example, because the 
molecular mass of the enzyme is much greater 
than that of the nerve agent. 

Gupta et al.' now report genetically modi- 
fied enzymes that hydrolyse organophos- 
phates. Notably, the nerve agents bind to 
these enzymes as substrates (non-covalently 
and reversibly), rather than as potent inacti- 
vators (which bind covalently and irrevers- 
ibly). The enzymes therefore behave as 
catalysts for organophosphate clearance — 
each enzyme molecule destroys thousands 
of molecules of a nerve agent, thus reducing 
the amount of enzyme required to detoxify a 
lethal dose. 

The authors’ work builds on an earlier study’ 
in which the active site of a bacterial phospho- 
triesterase enzyme was optimized by protein 
engineering to effectively catalyse the hydrolysis 
of a wide range of organophosphates, including 
sarin, soman and cyclosarin. Expression of this 
enzyme in caterpillars reduced the lethal effects 
of paraoxon*, an insecticide whose active form 
is an organophosphate. It is unlikely, however, 
that a protein of bacterial origin could be used 
as an effective therapeutic agent for organo- 
phosphate toxicity in humans. 

Gupta et al.' have made great strides towards 
solving this problem by using an enzyme 
largely of human origin. They worked with 
a serum paraoxonase enzyme, PON1, which 
catalyses the hydrolysis of lactones (cyclic 
molecules often found in nature), but which 
also catalyses the slow hydrolysis of a range of 
organophosphate nerve agents. Unfortunately, 
PON isn't stable enough to be expressed and 
manipulated in bacterial cells, as is required 
for engineering its catalytic properties. The 
authors partly overcame this challenge by 
using a hybrid enzyme’ that combined parts 
of human and rabbit PON1 enzymes. 

If an enzyme is to be an effective catalyst 
for the hydrolysis of organophosphate nerve 
agents, the rate constant (k,,,/K,,, a measure 
of the speed of a chemical reaction) for the 
process must exceed 10’ M™ min". Another 
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Figure 1 | Enzyme reactions of organophosphate nerve agents. a, In vivo, organophosphate nerve 

agents such as cyclosarin react to form a covalent bond with a serine amino-acid residue in the active site of 
acetylcholinesterase (AChE). This inactivates AChE, preventing it from hydrolysing the neurotransmitter 
acetylcholine, leading to suffocation and ultimately death. Butyrylcholinesterase (BChE) reacts with 
organophosphates in the same way, and can be injected into the bloodstream to scavenge the nerve agents, 
reducing their concentration to non-toxic levels. Because BChE is a stoichiometric scavenger, large 
concentrations are required for it to be therapeutically effective. b, Gupta et al.’ have engineered serum 
paraoxonase (PON1) to catalyse the rapid hydrolysis of organophosphates. One PON] molecule catalyses 
thousands of hydrolysis reactions, and so the amount of enzyme required to reduce the concentration of 
organophosphates to non-toxic levels is much lower than that needed for BChE. 


consideration to take into account is that the 
G-agents are chiral molecules — they form 
isomers known as enantiomers that are mir- 
ror images of each other. Only one of the two 
enantiomers (the S, enantiomer) is toxic. Frus- 
tratingly, Gupta and colleagues found that their 
hybrid PON1 enzyme primarily hydrolysed 
the non-toxic isomer (the R, enantiomer) of 
an analogue of cyclosarin, and that k.../K,, for 
the hydrolysis of the S, enantiomer was less 
than 200 M7 min. 

The authors therefore subjected the hybrid 
PON1 toa series of directed-evolution and 
rational-design experiments, in which altera- 
tions were made to the amino-acid sequence 
of the protein and the resulting mutants were 
screened to assess their effectiveness in hydro- 
lysing organophosphates. To identify the most 
effective mutants, the researchers developed 
a sorting procedure that compartmentalized 
individual bacteria expressing the mutant 
enzymes in emulsion droplets. By adding a sub- 
strate to the droplets that produces a fluorescent 
compound when hydrolysed by the enzymes, 
those bacteria expressing active mutants were 
easily detected and isolated. The most active 
mutants were then subjected to further rounds 
of alterations and screening. After several gen- 
erations of enzymes had been produced, the 
authors identified a mutant for which k.,,/K,, 
for the hydrolysis of the toxic isomer of a 
cyclosarin analogue exceeded 10’ M' min‘. 
This represents an enhancement of 100,000 
over the activity of the starting enzyme. 

The authors went on to show that the newly 
evolved enzyme acts as a prophylactic against 
G-agent exposure in mice. When they injected 
the animals with the recombinant enzyme at a 
dose of 2.2 milligrams per kilogram of animal 
body weight one hour before giving them a 
lethal dose ofa cyclosarin analogue, 75% of the 
mice were still alive after 24 hours. By contrast, 


none of the control mice survived. However, 
when mice were challenged with a lethal dose 
of the organophosphate six hours after injection 
of the recombinant enzyme, the survival rate 
dropped to 50%. No protection was afforded 
when the mice were challenged after 24 hours. 

Gupta and colleagues’ study nicely demon- 
strates that a predominantly human enzyme 
can be engineered to provide significant 
therapeutic protection against lethal exposure 
to nerve agents. The catalytic detoxification 
of organophosphate nerve agents is clearly 
superior to the stoichiometric detoxification 
currently afforded by human BChE, because 
much lower doses of the modified PON1 
enzyme are needed for a therapeutic effect. But 
for the authors’ approach to be truly practical, 
the duration of the catalytic scavenger’s activity 
in humans must be increased beyond the few 
hours that the current experiments indicate. In 
addition, the substrate specificity and catalytic 
efficiencies of scavengers must be expanded and 
enhanced for activity against other G-agents 
and against the even more lethal VX. One might 
also envisage the development of prophylactics 
to protect people from exposure to agricultural 
organophosphate pesticides, which have caused 
far more medical problems over time than the 
use of military nerve agents. m 
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to cover timely and important developments 

in the broader field of biology, ranging from 
the subcellular to the organismal level, and including 
molecular mechanisms and biomedicine. The reviews 
in this Insight discuss in turn specific aspects of the 
aetiology of cancer, the response to infection and 
inflammatory disease, cardiovascular development and 
disease, and epigenetic mechanisms. 

In the first review, Jane Visvader considers our current 
knowledge of the nature of the cells that give rise to 
cancer — the cells of origin. Tumours are heterogeneous 
in that different tumours within one organ can have 
different phenotypes, and individual tumours are made 
up of phenotypic and functionally heterogeneous cancer 
cells. The cellular origins of cancer are the subject of 
considerable debate as their influence can explain at 
least part of this heterogeneity. The article discusses key 
questions and concepts, as well as the implications for 323 
early cancer diagnosis and prevention. 

The review by Beth Levine, Noboru Mizushima and 
Herbert Virgin covers new developments at the interface 
between autophagy and immunology. Autophagy is a 
protective cellular process in which proteins, organelles 
and pathogens are sequestered in a double-membrane 
structure before delivery to the lysosome for degradation. 
Defects in the function of the autophagy pathway, and 
also impaired autophagy function that is independent 
of degradation, have been linked to the pathogenesis of 
infectious diseases and inflammatory syndromes. 

Eric Small and Eric Olson highlight recent progress 
in our understanding of the role of microRNAs 336 
in regulating cardiovascular function and disease 
mechanisms, and discuss the potential opportunities for 
new microRNA-based diagnostics and therapeutics. 343 

The final review by Raphaél Margueron and Danny 
Reinberg provides an update on our knowledge about 
the histone methyltransferase Polycomb repressive 
complex 2 (PRC2) in regulating chromatin and gene 
expression, and its effects on development and cancer. 

We hope that you will find the articles informative and 
stimulating. 

Alex Eccleston, Barbara Marte, Deepa Nath 
& Clare Thomas 
Senior Editors 
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Cells of origin in cancer 


Jane E. Visvader?? 


Both solid tumours and leukaemias show considerable histological and functional heterogeneity. It is widely accepted 
that genetic lesions have a major role in determining tumour phenotype, but evidence is also accumulating that cancers 
of distinct subtypes within an organ may derive from different ‘cells of origin’. These cells acquire the first genetic hit or 
hits that culminate in the initiation of cancer. The identification of these crucial target cell populations may allow earlier 
detection of malignancies and better prediction of tumour behaviour, and ultimately may lead to preventive therapies 


for individuals at high risk of developing cancer. 


dates back to 1600 Bc in Egypt, where the papyrus described eight 

cases of tumours occurring on the breast and their treatment by 
cauterization. Today, cancer is a leading cause of death worldwide, with 
the number of deaths from cancer projected to increase markedly, owing 
partly to an ageing global population. This immense cancer burden 
demands strategies that permit earlier detection, better stratification of 
tumours to guide therapy and the development of effective preventive 
therapies. Targeted therapeutic strategies that suppress the progression 
of preneoplastic cells towards the malignant state hold great promise 
for circumventing the huge challenges associated with the treatment of 
late-stage disease. 

Tumours show marked heterogeneity in their cellular morphology, 
proliferative index, genetic lesions and therapeutic response. The 
molecular and cellular mechanisms underpinning tumour hetero- 
geneity remain central questions in the cancer biology field. Key 
issues include whether the different subtypes of cancer reflect a dis- 
tinct ‘cell of origin, the extent to which the genetic mutational profile 
contributes to tumour phenotype and the nature of the relationship 
between the cell of origin and the cancer stem cell. This Review 
focuses on the strategies used to identify cells of origin, the impact 
of these cells on cancer cell fate and behaviour, and the implications 
for the development of improved prognostic markers and preven- 
tive therapies. 


Rittsrct the oldest description and surgical treatment of cancer 


Cell-of-origin and cancer stem-cell concepts are distinct 

It is important to note that the cell of origin, the normal cell that 
acquires the first cancer-promoting mutation(s), is not necessar- 
ily related to the cancer stem cell (CSC), the cellular subset within 
the tumour that uniquely sustains malignant growth. That is, the 
cell-of-origin and CSC concepts refer to cancer-initiating cells and 
cancer-propagating cells, respectively (Fig. 1). Although the tumour- 
initiating cell and the CSC have been used interchangeably, the 
tumour-initiating cell more aptly denotes the cell of origin. There 
is considerable evidence that several diverse cancers, both leukae- 
mias and solid tumours, are hierarchically organized and sustained 
by a subpopulation of self-renewing cells that can generate the full 
repertoire of tumour cells (both tumorigenic and non-tumorigenic 
cells)’. The cell of origin, the nature of the mutations acquired, and/ 
or the differentiation potential of the cancer cells are likely to deter- 
mine whether a cancer follows a CSC model. In most instances, the 
phenotype of the cell of origin may differ substantially from that of 
the CSC. 


Tumour heterogeneity 

Phenotypic and functional heterogeneity are hallmarks of cancers aris- 
ing in several organs. Variability can occur between tumours arising in 
the same organ (intertumoral heterogeneity), leading to the classifica- 
tion of discrete tumour subtypes. These subtypes are typically charac- 
terized by their molecular profile, together with their morphology and 
expression of specific markers (such as hormone and growth-factor 
receptors). Variation also occurs within individual tumours (intra- 
tumoral heterogeneity), in which the tumour cells often have a range 
of functional properties and a diverse expression of markers. For exam- 
ple, the proportion of cells that express the oestrogen receptor within 
a patient’s breast tumour can vary extensively, from 1% to 100%. The 
CSC and clonal-evolution models have been put forward to account 
for intratumoral heterogeneity and intrinsic differences in tumour- 
regenerating capacity (reviewed in refs 1 and 2). Interestingly, despite 
the heterogeneous nature of tumours, the histopathology and gene- 
expression profiles of tumours arising in patients often remain relatively 
stable during progression from localized disease to metastatic and even 
end-stage disease**. 

Two main mechanisms have been conceptualized to explain 
intertumoral heterogeneity: different genetic or epigenetic muta- 
tions occurring within the same target cell result in different tumour 
phenotypes (Fig. 2a), and different tumour subtypes arise from 
distinct cells within the tissue that serve as the cell of origin (Fig. 2b). 
It is important to note that these cellular and molecular mechanisms 
are not mutually exclusive, but can act together to determine tumour 
histopathology and behaviour. In addition, extrinsic mechanisms may 
be involved in generating tumour heterogeneity, because interactions 
between tumour cells and the stromal micro-environment are a crucial 
determinant of malignant growth’. Several studies on human cancers 
and mouse models have highlighted the importance of specific genetic 
aberrations in contributing to tumour behaviour. Many oncogenes 
and tumour-suppressor proteins, most prominently phosphatidyl- 
inositol-3-OH kinase (PI(3)K), MYC, RAS, p53, PTEN, ple™* and 
retinoblastoma protein (RB), are frequent culprits in diverse cancers, 
although the overall mutational profiles of different cancer types can 
vary considerably. Tumour maintenance undoubtedly depends on the 
continued expression of certain oncogenes — a phenomenon known 
as oncogene addiction’. Lineage-dependency oncogenes that have key 
survival roles, in which genetic changes may be predetermined by the 
lineage programs inherent in the tumour precursor cell’, are also likely 
to contribute. There is mounting evidence, however, that the nature of 
the cellular target has an important influence on tumour cell fate and 
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Figure 1 | The cell of origin and evolution of a cancer stem cell. Normal 
cellular hierarchy comprising stem cells that progressively generate common 
and more restricted progenitor cells, yielding all the mature cell types that 
constitute a particular tissue. Although the cell of origin for a particular tumour 
could be an early precursor cell such as a common progenitor, the accumulation 


pathology. Indeed, activation of the same oncogenic pathway in differ- 
ent cellular compartments or contexts may profoundly influence malig- 
nant potential®. For example, transgenic mouse models have shown 
that mutant Hras targeted to the hair follicle region highly predisposed 
mice to squamous carcinomas, whereas its targeting to more differ- 
entiated interfollicular or suprabasal cells resulted in papillomas with 
low malignant potential’”®. Moreover, transformation of distinct breast 
epithelial cells in vitro has indicated that the target cell is an important 
determinant of tumour phenotype”. 

Understanding the normal cellular hierarchy within a given tissue 
is an important prerequisite to identifying the cells of origin of can- 
cers. Organ development proceeds in a hierarchical manner from stem 
cells to committed progenitor cells, which in turn yield differentiated 
cells that constitute the bulk of the tissue or organ (Fig. 1). The most 
primitive cells, stem cells, have been favoured candidates for targets 
of transformation because of their inherent capacity for self-renewal 
and their longevity, which would allow the sequential accumulation of 
genetic or epigenetic mutations required for oncogenesis. Nevertheless, 
any cell in the hierarchy with proliferative capacity could serve asa cell 
of origin in cancer, if it acquires mutations that re-instigate self-renewal 
capacity and prevent differentiation to a post-mitotic state. 

The normal lineage hierarchy can serve as a framework to probe 
potential targets of carcinogenesis by comparison of lineage markers 
expressed on the surface of normal and neoplastic cell subsets. More 
accurate correlations, however, depend on comparisons of the expres- 
sion signatures of normal cell populations with those of the different 
tumour subtypes arising within that organ. Notably, histologically indis- 
tinguishable glial-cell tumours from different parts of the central nerv- 
ous system have distinct molecular gene signatures and chromosomal 
abnormalities, suggesting that they originated in different subpopula- 
tions of site-restricted progenitor cells'*”*. In a recent integrated genom- 
ics approach to studying tumour heterogeneity, the transcriptomes of 
human brain tumours were matched to those of mouse neural stem cells 
(NSCs) from different cellular compartments within the central nervous 
system. Embryonic cerebral NSCs and adult spinal NSCs were revealed 
as the potential cells of origin for supratentorial and spinal ependymo- 
mas, respectively’*. In breast cancer, the different molecular subtypes'”’* 
have also been linked to normal epithelial subpopulations by the inter- 
rogation of specific gene-expression signatures’”. Such observations 
remain correlative, however, until the tumorigenic potential of specific 
cells is proven in vivo by clonality or lineage-tracing studies. Although 
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of further epigenetic mutations by a cell within the aberrant population (in this 
case expanded) during neoplastic progression may result in the emergence of 
a CSC. In this model, only the CSCs (and not other tumour cells) are capable 
of sustaining tumorigenesis. Thus, the cell of origin, in which tumorigenesis is 
initiated, may be distinct from the CSC, which propagates the tumour. 


the hierarchy provides an important framework for understanding cells 
of origin in cancer, if tumour cells show phenotypic plasticity or ded- 
ifferentiate during neoplastic progression, then lineage markers and 
molecular signatures of tumour cells may not precisely reflect the true 
cell of origin in normal tissue. 


Strategies to investigate the cellular origins of cancers 
Genetically engineered mouse models have proven indispensable 
in addressing the cellular origin of cancers (Fig. 3). Two primary 
approaches have been used to tackle this question: one, transgenic 
or conditionally targeted gene technologies to explore the effects 
of oncogenes and tumour suppressors in different cellular con- 
texts; and, two, genetic alteration of cells ex vivo before evaluating 
their tumorigenic capacity in mice. The first approach requires 
cell-specific promoters that direct expression of an oncogene, or 
Cre-mediated deletion of a tumour-suppressor gene, in a specific 
subset of cells in vivo (Fig. 3a). Ideally, such studies should use at 
least two promoters with different cell-type specificity to reveal the 
tumorigenic susceptibility of distinct cell subpopulations within 
that tissue. In this model, targeting of only one cell subpopula- 
tion is expected to reveal tumours that recapitulate the phenotype 
of the human cancer being modelled. Although this approach has 
been increasingly used to study cells of origin, particularly in brain 
tumours, it is often hampered by a lack of established cell-lineage- 
specific promoters, given that unique markers of stem and progeni- 
tor cells do not exist for the overwhelming majority of organs and 
tissues. 

A further refinement of this in vivo targeting approach involves 
lineage tracing of cells as they undergo transformation. In this system, 
the main oncogenic event is activated conditionally in a limited number 
of cells rather than simultaneously in all cells that express the promoter. 
For example, a tamoxifen-inducible Cre recombinase-oestrogen recep- 
tor fusion protein (CreER)'*"’ driven by a cell-type-specific promoter 
allows inducible gene expression, in which the dose and number of 
pulses can be fine-tuned to ensure single-cell tracking. Lineage tracing 
at the clonal level is the current ‘gold standard’ for delineating the target 
cell of transformation in mouse models (Table 1). 

In the second approach, defined cell subpopulations are genetically 
manipulated ex vivo and subsequently transplanted orthotopically into 
mice to assess their predisposition to tumour initiation (Fig. 3b). The 
strategy is applicable to cells from both human and mouse tissues, and 
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Figure 2 | Two models of intertumoral heterogeneity. a, In the genetic (and 
epigenetic) mutation model, mutations primarily determine the phenotype 
of the tumour, such that different mutations result in different tumour 
morphology. b, In the cell-of-origin model, different cell populations in the 
lineage hierarchy serve as cells of origin for the different cancer subtypes 
arising within that organ or tissue. 
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relies on the reproducible sorting of functionally defined populations 
that can serve as targets for the introduction of relevant oncogenic 
lesions. This approach has been widely exploited to identify potential 
cells of origin in human leukaemias, many of which contain character- 
istic chromosomal translocations. 

An alternative way of exploring early cellular changes that occur 
before the onset of overt disease is to dissect the cellular components 
of preneoplastic tissue from individuals in families at high risk of 
cancer. These include carriers of germline mutations in the adenom- 
atous polyposis coli (APC) gene, hereditary non-polyposis colorectal 
cancer (HNPCC) genes (such as MSH2 and MLH1), and BRCA1 
or BRCA2 genes. Carriers of mutated APC and HNPCC genes are 
predisposed to developing colorectal cancer”; female BRCA1- or 
BRCA2-mutation carriers are prone to breast and ovarian cancer”'; 
and male BRCA2-mutation carriers often develop prostate cancer. 
This strategy has proven insightful in the case of BRCA1-mutation 
carriers (see ‘Histopathology does not necessarily reflect cell of ori- 
gin’). Combined with transplantation and clonality studies, cell sub- 
sets predisposed to neoplastic progression can thus be identified. 
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Cells of origin in haematopoietic malignancies 

In different leukaemias, both normal stem and committed progeni- 
tor cells have been implicated as cellular targets of transformation. In 
chronic myeloid leukaemia (CML) — one of the first disorders to be 
defined by a dominant genetic mutation — the long-term haemat- 
opoietic stem cell (HSC) containing the BCR-ABL mutation has been 
established as the cell of origin by in vivo clonality studies in humans”. 
Although the HSC maintains the chronic phase of the disease, analysis 
of samples from patients in blast crisis — the acute and advanced stage 
of disease — has indicated that subsequent genetic events occurring in 
downstream progenitor cells give rise to leukaemia stem cells, high- 
lighting the dynamic state of the tumorigenesis process”. The cells of 
origin for acute leukaemias, including myeloid, lymphoid and mixed- 
lineage, have not been definitively established. Human acute myeloid 
leukaemia (AML) may originate within the primitive haematopoietic 
cell compartment, on the basis of the similar cell-surface phenotypes of 
the leukaemia-initiating cell and the HSC, as well as lentivirus-mediated 
clonal-tracking studies”. A primitive human haematopoietic cell may 
also be the primary target of MLL fusion genes*”*. Moreover, in vivo 
evidence has implicated a human HSC-like cell as the initiating cell in 
a case of childhood leukaemia arising in utero”. 

Several studies have addressed potential cells of origin in mouse leu- 
kaemia models by transducing primary haematopoietic cell populations 
with oncogenes before transplantation, but these have yielded variable 
results. For mouse models of CML, only BCR-ABL targeted to HSCs, but 
not to committed progenitor cells, induced myeloproliferative disease”, 
consistent with findings for human CML. Interestingly, the MLL-GAS7 
fusion protein produced mixed lymphoid leukaemia when transduced 
into HSCs or multipotential progenitor cells but not when introduced 
into lineage-restricted progenitors. However, the MOZ-TIF2 (ref. 28), 
MLL-AF9 (ref. 29) and MLL-ENL*™” fusion proteins all initiated AML 
irrespective of the cell subtype transduced. Although HSCs generally 
appeared more susceptible to transformation than committed progeni- 
tors, a self-renewal program seemed to be reactivated in the latter cells 
during leukaemogenesis. In a ‘knock-in’ mouse model of MLL-AF9, 
only HSCs that expressed high levels of the fusion product and not the 
granulocyte-macrophage progenitors were transformed, but the latter 
could be efficiently transformed by a higher dose of MLL-AF9 after 
retroviral transduction”. Thus, oncogene dosage affects transformation 
susceptibility, emphasizing the importance of using models that permit 
oncogene expression at levels relevant to human disease. 

Further evidence that cancer can be initiated in cells other than stem 
cells has emerged from cell-fate mapping studies in transgenic mice 
overexpressing Lmo2: preleukaemic T-cell progenitors that had acquired 
self-renewal potential were identified as the cell of origin for T-cell acute 
lymphoblastic leukaemia (T-ALL) in this model”. Pertinently, mice 
lacking three pathways commonly repressed in cancer (p53, p16" 
and p19") contain cells that phenotypically resemble haematopoietic 
multipotential progenitor cells but have long-term reconstituting ability, 
indicating that they have acquired self-renewal capacity™. 


Cells of origin in solid tumours 

Evidence is increasing that either stem or progenitor cells can act as tar- 
gets for tumour initiation in a range of solid tumours (Table 1). Lineage- 
tracing studies (shown schematically in Fig. 4a) have identified probable 
cells of origin of intestinal, prostate and basal cell carcinomas, as well as 
pancreatic ductal adenocarcinoma, in mouse genetic models. Several 
other reports have used cell-specific promoters to drive Cre-mediated 
expression of the oncogenic event(s) in different cellular compartments 
of the mouse, whereas genetic manipulation of discrete cellular subsets 
has provided valuable insight into cell types prone to the initiation of 
carcinogenesis. It is crucial to note that although many studies have 
clearly identified the lineage in which the cancer originates, the precise 
cell type in the hierarchy (the cell of origin) in which transformation 
occurs remains elusive in most cases. Nevertheless, in mouse mod- 
els of intestinal and prostate tumours, it seems clear that the cancers 
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Figure 3 | Strategies used to identify cells of origin in cancer. a, 
Genetic mouse models can be used to either activate an oncogene or 
inactivate a tumour-suppressor gene in a discrete subpopulation of cells 
using cell-type-specific promoters. Comparative mouse models in which 
different promoters (A or B) drive expression of the same oncogenic 
lesion (either oncogene activation or tumour-suppressor inactivation) in 
the brain. Initiation of a glioblastoma is observed in the case of promoter 
A and a medulloblastoma in the case of promoter B, where promoters 


originate in a bona fide stem cell that is capable of self-renewal and 
multilineage differentiation. Although the lineage in which the can- 
cer originates has been revealed for skin, pancreatic, brain and breast 
tumours, the precise cells of origin have yet to be defined. 

Two distinct crypt stem cells have been identified as the cells of origin 
for intestinal cancers. The vast majority of colorectal cancer is caused 
by mutations in the WNT signalling pathway, including loss of the 
negative regulator APC and activating mutations in b-catenin, both 
of which result in constitutive WNT activation”. Apc deletion in long- 
lived stem cells (LGR5*) but not in short-lived transit-amplifying cells 
(using AhCre) revealed stem cells as a cell of origin for intestinal cancer 
in mice (Fig. 4b)”*. This target cell is also marked by CD133 (also known 
as PROM] or prominin 1)”. A novel intestinal stem cell located in the 
+4 or +5 position from the base of the crypt and therefore distinct from 
the LGR5* stem cell was also shown to be susceptible to tumorigenesis 
by deregulated WNT signalling using a BMI1-CreER knock-in model*. 
LGRS* stem cells are likely to be the target population for WNT-driven 
tumorigenesis in the stomach, where these cells seeded small adeno- 
mas”. The intestinal tumour load, however, precluded further lineage 
tracing of stem cells during the development of stomach cancer. 

The cell of origin for brain cancers has been investigated using sev- 
eral mouse genetic models that have differed in design and the nature 
of the initiating oncogenic lesions. The models have predominantly 
included conditionally targeted mice, the RCAS-TVA system — in 
which gene transfer is mediated by an oncogene-carrying RCAS ret- 
rovirus to somatic cells in TVA transgenic mice’ — and cell-culture- 
based analyses. Stereotactic injection of viruses into different areas 
of the brain*’” has also been used for the introduction of oncogenes 
or Cre recombinase to mimic focal tumorigenesis, but this approach 
cannot be used to identify cells of origin of cancer unequivocally 
as the transduced cell types are unknown. Although the available 
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A and Bare active in different cell subpopulations within the brain. b, 
Potential cells of origin in cancer can be addressed through the sorting 

of defined subpopulations from human or mouse tissue and their genetic 
manipulation ex vivo. Cell subpopulations are first transduced with genes 
encoding the oncogenic lesion(s), together with a fluorescent marker, then 
transplanted orthotopically into immunocompromised mice to evaluate 
the tumorigenic potency of the different subpopulations. GFP, green 
fluorescent protein. 


evidence argues for stem or multipotential neural progenitors in the 
subventricular zone (SVZ) as the primary cellular target for gliob- 
lastoma development, the cell of origin remains elusive owing to the 
complexity of this zone”. Many studies on brain tumorigenesis have 
used the nestin (Nes) or Gfap promoter regions to direct expression 
or inactivation: it is important to note that although both promoters 
drive expression in neural precursor cells, the Gfap promoter is also 
active in mature astrocytes”’. Nonetheless, nestin-positive precursors 
were more susceptible to transformation by RAS and AKT than the 
GFAP-positive population, and produced high-grade glioblastomas, 
consistent with tumours originating in the stem/progenitor popula- 
tion*’. Moreover, neural precursor cells in the SVZ of the adult brain 
efficiently initiated glioblastomas after conditional inactivation of p53 
(also known as Trp53), Pten and/or Nfl tumour-suppressor genes”, 
and presymptomatic mice exhibited a premalignant cell population. By 
contrast, the more differentiated cell types in non-neurogenic areas of 
the adult brain proved less susceptible to malignant transformation”. 
Similarly, mice deficient in varying combinations of p53, Pten and/ 
or Rb (also known as pRb and Rb1) developed tumours only in the 
SVZ and not from mature peripheral astrocytes™. Interestingly, the 
same stem/progenitor population seemed to initiate either gliomas 
or medulloblastomas, depending on the nature of the genetic lesions. 
More restricted progenitor cells may also initiate glioma development. 
Single-cell tracking studies of cells expressing mutant p53 implicate 
transit-amplifying cells in the SVZ”. Although oligodendrocyte pro- 
genitors and cells within the astrocyte compartment may also have the 
potential to seed glioblastomas**”, culturing cells before manipulation 
may not accurately reflect the in vivo situation, and the presence of 
more primitive cells within the cell cultures cannot be excluded. Thus, 
definitive evidence that mature astrocytes can serve as cells of origin 
for brain tumours awaits further experimentation. 
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Table 1 | Cells of origin (proven and candidate) identified in solid tumours by targeting distinct cellular subsets 


Tumour type Genetic model Promoter-Cre construct Lineage Cell of origin 
tracing 
Mouse models 
Brain: Glioblastoma RAS, AKT activation (RCAS-TVA system: A = eural progenitor cell*° 
nestin, Gfap promoters) 
p16'"**2/p194", BMI1 inactivation; A = eural progenitor and astrocyte*®*” 
mutant EGFR 
p53, NF1 and/or PTEN inactivation estin-CreERT2, Adeno-Cre = ultipotent progenitor 
PDGFB activation (RCAS-TVA system) A = Oligodendrocyte progenitor® 
RAS, AKT activation; p53 inactivation GFAP-Cre = ultipotent progenitor“? 
Mutant p53 expression GFAP-Cre = eural progenitor or transit-amplifying cell*® 
PTEN, p53 inactivation GFAP-Cre = ultipotent progenitor™ 
Medulloblastoma Patched inactivation MATH1-Cre, GFAP-Cre = ultipotent progenitor and granule neuron 
progenitor’? 
Smoothened activation GFAP-Cre, MATH1-Cre, OLIG2- = ultipotent progenitor and granule neuron 
TVA-Cre progenitor*® 
RB, p53, PTEN inactivation GFAP-Cre - ultipotent progenitor 
RB, p53 inactivation Adeno-Cre = eural progenitor cell®° 
B-catenin mutant, p53 inactivation BLBP-Cre, ATOH1-Cre = Dorsal brainstem progenitor®! 
Ependymoma p16'"*2/p194" inactivation; EPHB2 NA = Embryonic cerebral stem/progenitor cell'* 
(supratentorial) activation 
Intestine APC inactivation AhCre, LGR5-CreERT2 + Stem cell®® 
Mutant B-catenin CD133-CreERT2 + Stem cell” 
Mutant B-catenin BMI1-CreER + Stem cell*® 
Lung Kras activation Adeno-Cre = Bronchioalveolar stem cell”” 
Mammary NOTCH1 activation in cell subsets A “ Luminal progenitor® 
BRCA1, p53 inactivation BLG-Cre, K14—Cre = Luminal progenitor® 
Pancreas Kras activation, inflammation RIP-CreER + Endocrine cell® 
Prostate PTEN inactivation KX3.1-CreERT2 + Luminal stem cell®4 
ERG1, Pl(3)K and/or AR expression A = Basal progenitor®? 
PTEN inactivation PB-Cre = Basal progenitor®® 
PTEN inactivation PSA-Cre = Luminal cell°°°” 
Skin/basal cell Smoothened activation K14—CreER + nterfollicular epidermal progenitor’? 
carcinoma 
Stomach APC inactivation LGR5-CreERT2 + Stem cell°? 
Human tissue 
Breast (basal-like Preneoplastic BRCA1*” cell subsets A - Luminal progenitor?” 
subtype)* 
Prostate PI(3)K, ERG, AR into cell subsets A - Basal progenitor® 


Adeno, adenoviral; Ah, cytochrome P450 1A1 gene (also known as Cyp1a1); AR, androgen receptor; BLG, B-lactoglobulin; K14, cytokeratin 14; NA, not applicable; 
PB, probasin (prostate-specific); PSA, prostate-specific antigen; RIP, rat insulin promoter. 
*Refers to analysis of specific subsets from normal versus premalignant human breast tissue, leading to identification of a candidate cell of origin. 


Unipotent cells within the mouse brain have been identified as the 
cell of origin for medulloblastoma. Constitutive hedgehog signalling 
(due to loss of Ptc or activation of Smo) in either the stem-cell com- 
partment or granule neuron progenitor cells could initiate medullob- 
lastomas but not astrocytomas or oligodendrogliomas**”. Targeted 
deletion of a different set of genes (p53 and Rb) also supports the 
notion that cerebellar stem cells or lineage-restricted granule progeni- 
tor cells can give rise to medulloblastomas”’. Extending these observa- 
tions further, it was shown that the cells must transition to the granule 
progenitor stage for the initiation of medulloblastomas, indicating 
that the true cell of origin for medulloblastomas that exhibit hedgehog 
pathway activation” is a unipotent progenitor cell®*. A distinct cell type 
within the dorsal brainstem has recently emerged as the cell of origin 
for medulloblastomas that harbour activating mutations in the WNT 
pathway”, indicating that different subtypes of medulloblastoma have 
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distinct cellular origins. In mouse models of malignant peripheral 
nerve sheath tumours, tumours may initiate from differentiated glial 


cells in the adult brain®*”*’. 


Prostate cancer can originate from distinct cell lineages 

Prostate cancers have been widely presumed to originate from mature 
luminal cells as these cancers are characterized by an expansion of 
luminal cells and the absence of basal cells. Recent findings, however, 
implicate distinct stem cells in the basal and the luminal cellular com- 
partments, each of which can be targeted for oncogenesis by the loss 
of PTEN or PI(3)K activation. Rare luminal epithelial stem cells that 
express NKX3.1 and are castration resistant were identified™, and 
these cells could initiate high-grade prostate intraepithelial neoplasia 
(PIN) and carcinomas. It is not yet clear whether these cells, termed 
CARNs, exist within ‘normal’ mouse or human prostate (that is, the 
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non-castrated state), but these bipotent, self-renewing cells may be 
mobilized as facultative stem cells during prostate regeneration after 
androgen withdrawal. Indeed, only the basal population isolated from 
normal mouse prostate has been demonstrated to contain stem cells 
with prostate-regenerative potential”. The question of whether CARNs 
exist in patients with prostate cancer is difficult to address as castration 
is implemented only after the development of advanced disease. The 
identification of a luminal cancer-initiating cell is consistent with find- 
ings that deletion of Pten in luminal cells of the mouse prostate leads to 
prostatic hyperplasia”. 

Conversely, basal cells have been demonstrated as an efficient target 
of tumorigenesis in a Pten-deficient mouse model™ or when genetically 
manipulated ex vivo to overexpress Erg, androgen receptor (Ar) and/ 
or PI(3)K, resulting in PIN lesions and carcinomas”. The tumorigenic 
susceptibility of purified basal and luminal subpopulations from human 
prostate tissue was recently evaluated “. When the cells were transduced 
with relevant oncogenic lesions, together with a fluorescent marker, and 
transplanted into immunocompromised mice, only the basal cells could 
initiate the development of prostate cancer reminiscent of the luminal- 
like cancers that arise in humans. 


Histopathology does not necessarily reflect cell of origin 
Tumours have largely been classified on the basis of their histological 
appearance and expression of markers (such as ER and HER? in breast 
cancer) that predict the response of the tumour to a given treatment. 
However, the histological and cell-surface marker profiles of tumours 
do not necessarily predict the cell of origin, as illustrated above for pros- 
tate cancer. Other examples that underscore this point include breast, 
pancreatic and basal cell carcinomas, as discussed below. 

Individuals that harbour mutations in the BRCA1 tumour-suppres- 
sor gene develop breast cancers that usually resemble the basal-like 
subtype, typically associated with poor clinical prognosis*”’. The 
basal stem cell has therefore been presumed to be the transforma- 
tion target for this tumour subtype, but the luminal progenitor has 
instead emerged as the likely cell of origin. Analysis of cellular subsets 
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Figure 4 | Identification of crypt stem cells as 

the cell of origin in intestinal cancer by lineage 
tracing. a, Schematic depiction of lineage tracing in 
a colonic crypt, in which a single dose of tamoxifen 
can be used to activate CreER specifically in a stem 
cell to drive expression of a given oncogenic lesion in 
this cell population. The promoter that drives CreER 
expression will determine the cells in which this 
occurs. In the case of a stem cell, the incorporated 
reporter gene, such as lacZ, will mark all progeny 

of the stem cell. b, Schematic representation 
summarizing the data from ref. 36, in which lineage 
tracing of either stem or transit-amplifying cells 
deficient in Apc shows the initiation of intestinal 
tumours from stem cells only. 
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in precancerous breast tissue from BRCA I-mutation carriers dem- 
onstrated expansion of luminal progenitor cells that showed altered 
growth properties and aberrantly produced nodules when transplanted 
into mice (ref. 17 and F. Vaillant and J.E.V., unpublished data). Moreo- 
ver, there are significant similarities between the gene-expression pro- 
files of normal breast luminal progenitors, preneoplastic tissue from 
BRCA1-mutation carriers and basal-like breast cancers'’. Indeed, inac- 
tivation of Brca1 (and p53) in either luminal or basal cells of the mouse 
mammary gland showed that only the luminal cell population initiated 
basal-like cancers reminiscent of those arising in BRCA1-mutation 
carriers™. The presence of ALDH1* lobules in pathologically normal 
tissue from BRCA1-mutation carriers™ is compatible with a luminal 
cell of origin, because luminal progenitors exhibit ALDH activity (F. 
Vaillant and J.E.V., unpublished data). NOTCH1 activation also targets 
luminal progenitor cells, generating an aberrant, self-renewing progen- 
itor cell that yields mammary hyperplasia and, eventually, tumours”. 
Accordingly, high NOTCH1 levels occur in basal-like breast cancers 
and predict poor prognosis®. The cells of origin for most other breast 
cancers have yet to be defined. In particular, the role of the mammary 
stem cell in breast oncogenesis is unclear, although WNT pathway 
activation primarily targets this population”, and the ‘claudin-low’ 
subtype of breast cancer, which is characterized by low expression of 
genes involved in tight junctions and cell-cell adhesion, shares a similar 
molecular profile to that of the stem-cell subset'”®. 

Pancreatic ductal adenocarcinoma (PDAC) and premalignant duc- 
tal lesions (termed pancreatic intraepithelial neoplasia) have a ductal 
morphology, suggesting that they develop from pancreatic duct cells”. 
Unexpectedly, however, premalignant lesions were shown to derive from 
differentiated acinar cells that were reprogrammed to a duct-like pheno- 
type”. Moreover, targeting a Kras oncogenic signal to insulin-positive 
endocrine cells induced PDAC. Notably, the ductal reprogramming of 
acinar cells required inflammatory tissue damage, highlighting a role 
for non-genetic factors in contributing to tumour phenotype. Ductal 
adenocarcinoma can also arise from other pancreatic cell lineages in the 
absence of tissue injury, for example PDX1-expressing cells”. 
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Lineage-tracing studies have shown that basal cell carcinomas origi- 
nate in progenitor cells resident in the interfollicular epidermis of the 
skin rather than from stem cells as originally postulated”. Conditional 
activation of hedgehog signalling in different cellular compartments, 
combined with cell-fate mapping, showed that long-lived progenitors 
in the interfollicular epithelium, but not the hair follicle bulge stem cell 
or the transit-amplifying cells, produced tumours. The block in differ- 
entiation evident in interfollicular epidermal cell clones with constitu- 
tive hedgehog signalling correlated with the expression of basal lineage 
markers (such as P-cadherin and keratins 7 and 15) and may have led 
to the notion that basal cell carcinomas arise from hair follicle bulge 


stem cells*”?, 


Potential relationships between cells of origin and CSCs 
Although a stem cell may sustain the first oncogenic hit, subsequent 
alterations required for the genesis of a CSC can occur in descendent 
cells (Fig. 1). This is exemplified by CML, in which the HSC is the cell 
of origin in the more indolent phase of the disease but in patients with 
CML blast crisis, granulocyte-macrophage progenitors acquire self- 
renewal capacity through a B-catenin mutation and emerge as the prob- 
able CSC”. 

In some instances, particularly in early-stage cancers, the CSC may 
closely resemble the cell of origin, although this remains to be proven. 
For example, the leukaemia-initiating cell in AML” may prove to be 
the same as the leukaemia stem cell that propagates the disease. In a 
mouse model of intestinal cancer, despite all neoplastic cells arising from 
CD133* stem cells, only a small fraction of the tumour cells retained 
CD 133 expression. It is tempting to speculate a hierarchical model of 
tumour progression in which this small subset of CD133* cells might 
generate the full repertoire of tumour cells and thereby correspond 
to CSCs. This notion is compatible with the observation that CD133 
marks CSCs in certain human colorectal tumours”””*. Nevertheless, it 
remains to be determined whether these CD133* or LGRS’ cells have 
tumour-propagating ability. Bronchioalveolar stem cells (BASCs) have 
been implicated as the cell of origin for lung adenocarcinomas induced 
by mutant Kras” in mice and may be closely related to the CSC, because 
the BASC marker Scal was recently shown to identify CSCs in certain 
mouse models of non-small-cell lung cancer”. In prostate cancer, ifthe 
oncogenic transformation of CARNs leads to the formation of CSCs in 
prostate cancer, then this might explain how early events occurring in 
the cell of origin can contribute to the emergence of hormone-refractory 
disease™. Although the relationships between tumour cells of origin and 
CSCs are not well understood, comprehensive cellular analyses of the 
preneoplastic and neoplastic states of different tumour subtypes should 
eventually shed light on this issue. 


Therapeutic and diagnostic implications 

Identification of the cell of origin has important implications for new 
preventive therapeutic approaches to suppress or reverse the initial 
phase of disease. Cancer chemoprevention will be most applicable to 
individuals within families at high risk of cancer such as BRCA1/2- 
mutation carriers. Cell-surface markers or proto-oncogenic kinases 
such as c-KIT” that show altered expression in cell subsets in preneo- 
plastic tissue can be evaluated as prognostic markers, and for their abil- 
ity to eradicate or modulate aberrant cells in either preneoplastic or 
established disease. 

In principle, individuals that carry a defect in the APC gene, and 
are thus highly susceptible to colorectal cancer, could benefit from 
prophylactic treatment that targets APC-deficient cells for apoptosis. 
Tumour-necrosis-factor-related apoptosis-inducing ligand (TRAIL) in 
combination with all-trans retinoic acid selectively induced apoptosis in 
APC-deficient premalignant cells and intestinal polyps, thus inhibiting 
tumour growth”. Furthermore, treatment of biopsy samples of human 
colonic polyps from patients with familial adenomatous polyposis 
showed selective apoptosis of polyps, whereas normal tissue was unaf- 
fected, providing a potentially effective method of chemoprevention in 
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these patients. In other families at high risk of colorectal cancer, with 
mutations in the MLH1 or MSH2 mismatch repair genes, inhibition 
(short-term and intermittent) of selective DNA polymerases may bea 
potential chemopreventive strategy, as these agents have been shown 
to elicit tumour cell death in patients with HNPCC”. 

Tamoxifen and aromatase inhibitors (inhibitors of oestrogen action 
and biosynthesis, respectively) are the prototypes for chemopreven- 
tive agents in hormone-receptor-positive breast cancer, as they mark- 
edly reduce the rate of disease recurrence and more than halve the 
incidence of new cancers in patients*’. Recent findings have clari- 
fied how ovarian hormone exposure enhances breast cancer risk by 
showing that mammary stem cells, despite lacking receptors for these 
hormones, are highly responsive to steroid hormone signalling in 
vivo”. As the paracrine signals relayed to these stem cells seem to 
involve the receptor activator of NF-xB (RANK) signalling pathway, 
an exciting corollary of these findings is that it should be possible 
to prevent some forms of breast cancer by driving stem cells into a 
dormant state — for example, by blockade of the RANK pathway — 
for which inhibitors are already in clinical trial for bone metastases. 
There may also be prophylactic benefit for BRCA 1/2-mutation car- 
riers in the use of poly(ADP-ribose) polymerase (PARP) inhibitors’, 
which are being evaluated for the treatment of BRCA1/2-associated 
breast cancers”, if the early lesions in these individuals prove to be 
defective in DNA repair. 


Perspective 

It seems intuitive that both the cell of origin and the pattern of 
acquired mutations determine tumour fate and phenotype. The close 
association between cell lineage and cancer phenotype suggests that 
lineage-restricted mechanisms that normally operate during develop- 
ment may contribute to tumorigenesis. The cell of origin may often 
correspond to the normal tissue stem cell, exploiting its intrinsic self- 
renewal ability. This may particularly apply to tissues with very high 
turnover, such as the gut, because progenitor cells may not live long 
enough to acquire the full set of mutations required for malignancy. 
The stem cell or an early progenitor cell has also emerged as a likely 
cell of origin in certain leukaemias, glioblastomas and prostate cancer. 
In other malignancies, however, the initiating cell can be a restricted 
progenitor, as in the case of medulloblastomas, basal cell carcinomas 
and BRCA 1-associated breast cancer. Indeed, in cell types that retain 
high proliferative potential, such as some differentiated lymphoid cells, 
the cell of origin could even be a mature cell type. Notably, there are 
several examples indicating that tumour phenotype may not directly 
reflect tumour histology or lineage marker expression, thus highlight- 
ing the requirement for in vivo studies to assess the propensity of cell 
populations to act as cells of origin. 

Mouse models of oncogenesis have been pivotal in uncovering the 
cellular origins of cancer and the impact of specific mutations on tumor- 
igenesis. Arguably, the choice of the genetically modified mouse model 
and the promoter/enhancer to recapitulate the effects of the oncogenic 
lesion has a major influence on tumour phenotype and behaviour. More 
specific promoters to drive expression of an initiating event within a 
definitive cellular compartment are likely to evolve as the normal lineage 
hierarchies within tissues are further refined. For studies on the cell of 
origin in human tissues, genetic and cellular analyses of tumour cell 
populations, at the single-cell level, from patients at different stages of 
disease should provide substantial insight into the relationships among 
normal cells, cells of origin and CSCs. 

Identification of the cell of origin may permit a more systematic 
analysis of the genetic lesions involved in tumour initiation and pro- 
gression, and serve as a platform for the identification of early disease 
biomarkers. It may also have important implications for preventing 
relapse, particularly in cases in which relapse results from a ‘prema- 
lignant’ clone (perhaps the cell of origin itself) that persists in the 
patient before acquiring a mutation that renders it malignant. If so, 
even patients with cancer who have a profound regression may require 
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maintenance therapy to reduce the chance of relapse. Finally, the gene 
signature of the cell of origin may elucidate key molecular pathways 
and driver mutations that could lead to new therapeutic approaches to 


prevent or target early-stage disease. m 
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Autophagy in immunity 


and inflammation 


Beth Levine’*”, Noboru Mizushima‘ & Herbert W. Virgin? 


Autophagy is an essential, homeostatic process by which cells break down their own components. Perhaps the 
most primordial function of this lysosomal degradation pathway is adaptation to nutrient deprivation. However, in 
complex multicellular organisms, the core molecular machinery of autophagy — the ‘autophagy proteins’ — orchestrates 
diverse aspects of cellular and organismal responses to other dangerous stimuli such as infection. Recent developments 
reveal a crucial role for the autophagy pathway and proteins in immunity and inflammation. They balance the beneficial 
and detrimental effects of immunity and inflammation, and thereby may protect against infectious, autoimmune and 


inflammatory diseases. 


here is only one known mechanism that eukaryotic cells possess 

to dispose of intracellular organelles and protein aggregates that 

are too large to be degraded by the proteasome. It is therefore not 
surprising that this mechanism — the lysosomal degradation pathway 
known as autophagy — is also used to degrade microorganisms (such 
as viruses, bacteria and protozoa) that invade intracellularly’”. Indeed, 
the mutation of autophagy genes increases susceptibility to infection 
by intracellular pathogens in organisms ranging from plants to flies 
to worms to mice, and possibly to humans. Perhaps less apparent, but 
teleologically as intuitive, the autophagy pathway or unique functions of 
autophagy proteins also have a central role in controlling other diverse 
aspects of immunity in multicellular organisms. 

The autophagy machinery is thought to have evolved as a stress 
response that allows unicellular eukaryotic organisms to survive dur- 
ing harsh conditions, probably by regulating energy homeostasis and/ 
or by protein and organelle quality control. The same machinery might 
therefore be expected to diversify functionally in complex metazoan 
organisms, so as to regulate new layers of defences used by multicellular 
organisms to confront different forms of stress. A plethora of genetic, 
biochemistry, cell biology, systems biology and genomic studies have 
recently converged to support this notion. The autophagy machinery 
interfaces with most cellular stress-response pathways’, including 
those involved in controlling immune responses and inflammation. 
This interface is not only at the level of the autophagy pathway, but also 
entails direct interactions between autophagy proteins and immune 
signalling molecules*. There is a complex reciprocal relationship 
between the autophagy pathway/proteins and immunity and inflam- 
mation; the autophagy proteins function in both the induction and sup- 
pression of immune and inflammatory responses, and immune and 
inflammatory signals function in both the induction and suppression 
of autophagy. Moreover, similar to cancer, neurodegenerative diseases 
and ageing’, defects in autophagy — through autophagy gene mutation 
and/or microbial antagonism — may underlie the pathogenesis of many 
infectious diseases and inflammatory syndromes. 

In this Review, we describe recent advances in our evolving 
comprehension of the interface between autophagy, immunity and 
inflammation. We discuss how emerging concepts about the functions 


of the autophagy pathway and the autophagy proteins may reshape our 
understanding of immunity and disease. This set of proteins not only 
orchestrates the lysosomal degradation of unwanted cargo, but also 
exerts intricate effects on the control of immunity and inflammation. 
Thus, the autophagy pathway and autophagy proteins may function as 
a central fulcrum that balances the beneficial and harmful effects of the 
host response to infection and other immunological stimuli. 


Mechanisms and membrane dynamics of autophagy 
Autophagy is a general term for pathways by which cytoplasmic mat- 
erial, including soluble macromolecules and organelles, is delivered to 
lysosomes for degradation’. There are at least three different types of 
autophagy, including macroautophagy, chaperone-mediated autophagy 
and microautophagy. Macroautophagy, usually referred to simply as 
autophagy, is the subject of this Review (Fig. 1). In this pathway, a por- 
tion of cytoplasm (usually 0.5-1 «zm in diameter) is engulfed by an 
isolation membrane, or ‘phagophore; resulting in the formation of a 
double-membrane structure known as the autophagosome. The outer 
membrane of the autophagosome fuses with the lysosome to become an 
autolysosome, leading to the degradation of autophagosomal contents by 
lysosomal enzymes. Autophagosomes can also fuse with endosomes or 
multivesicular bodies and major histocompatibility complex (MHC)- 
class-II-loading compartments’. Autolysosomes become larger as more 
autophagosomes and lysosomes fuse, but at a termination phase lyso- 
somes are tubulated and fragmented for renewal’. 

The membrane dynamics of autophagosome formation involve com- 
plex processes, which are not completely understood. Nonetheless, the 
molecular dissection of autophagy membrane dynamics, stimulated by 
the discovery of ATG (autophagy-related) genes in yeast’, has shed con- 
siderable light on this topic (Table 1). Several recent studies suggest that 
the endoplasmic reticulum (ER) is crucial for autophagosome forma- 
tion. The ER cisternae often associate with developing autophagosomes, 
and electron tomography analysis has demonstrated direct connections 
between the ER and autophagosomal membranes'”"". Moreover, the 
function of several key autophagy proteins seems to be at the level of 
the ER (Fig. 1). 

Autophagy is induced by nutrient starvation through the inhibition 
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Figure 1 | Schematic overview of autophagy and its regulation. 
Overview of the autophagy pathway. The top right box shows a model 

of our current understanding of the molecular events involved in 
membrane initiation, elongation and completion of the autophagosome. 
The major membrane source is thought to be the endoplasmic reticulum 
(ER), although several other membrane sources, such as mitochondria 
and the plasma or nuclear membrane, may contribute. After induction 

of autophagy, the ULK1 complex (ULK1-ATG13-FIP200-ATG101) 
(downstream of the inhibitory mTOR signalling complex) translocates 

to the ER and transiently associates with VMP1, resulting in activation of 
the ER-localized autophagy-specific class III phosphatidylinositol-3-OH 
kinase (PI(3)K) complex, and the phosphatidylinositol-3-phosphate 
(PtdIns(3)P) formed on the ER membrane recruits DFCP1 and WIPIs. 
WIPIs and the ATG12—ATG5-ATGI16L1 complex are present on the outer 
membrane, and LC3-PE is present on both the outer and inner membrane 
of the isolation membrane, which may emerge from subdomains of the ER 


of mammalian target of rapamycin (mTOR), resulting in translocation 
of the mTOR substrate complex (ULK1/2, ATG13, FIP200 (also known 
as RB1CC1) and ATG101) from the cytosol to certain domains of the 
ER or closely attached structures’””’. This leads to the recruitment of 
the class III phosphatidylinositol-3-OH kinase (PI(3)K) complex, which 
includes at least VPS34 (also known as PIK3C3), VPS15 (PIK3R4 and 
p150), beclin 1 and ATG14, to the ER’*”*. The PI(3)K complex pro- 
duces phosphatidylinositol-3-phosphate (PtdIns(3)P), which recruits 
effectors such as double FY VE-containing protein 1 (DFCP1) and WD- 
repeat domain phosphoinositide-interacting (WIPI) family proteins. 
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termed omegasomes. The cellular events that occur during autophagy are 
depicted in the bottom diagram, including the major known cellular and 
microbial proteins that regulate autophagy initiation, cargo recognition 
and autophagosome maturation. Only those cellular proteins known to 

be adaptors for targeting microbes are shown; other proteins (not shown) 
also function in cargo recognition of mitochondria and other organelles. 
CMV, cytomegalovirus; DAMP, danger-associated molecular pattern; 
DAP, death-associated protein; EBV, Epstein-Barr virus; HBV, hepatitis B 
virus; HSV-1, herpes simplex virus 1; KSHV, Kaposi’s sarcoma-associated 
herpesvirus; LIR, LC3-interacting region (motif); LPS, lipopolysaccharide; 
MDP, muramyl dipeptide; Pam,Cys,, a synthetic TLR2 agonist; PAMP, 
pathogen-associated molecular pattern; PERK, an eIF2a kinase; PGN, 
peptidoglycan; PRGP-LE, a peptidoglycan-recognition protein; PRR, 
pathogen-recognition receptor; ROS, reactive oxygen species; Ub, 
ubiquitin; UBA, ubiquitin-associated domain; UBZ, ubiquitin-binding zinc 
finger; v-FLICE, viral FLICE. 


DFCP1 is diffusely present on the ER or the Golgi, but translocates to the 
autophagosome formation site in a PtdIns(3)P-dependent manner to 
generate ER-associated (-like structures termed omegasomes’*. Among 
the four WIPI isoforms, WIPI2 is the major form in most cell types and 
functions downstream of DFCP1, and it may promote the development 
of omegasomes into isolation membranes or autophagosomes’®. 

An ER-associated multispanning membrane protein, VMP1, is also 
important for autophagosome formation. Although VMP1 interacts 
with beclin 1 and is present at the autophagosome formation site at 
an early stage, it seems to function at a late stage in autophagy’*””"*. 
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Table 1 | Key proteins involved in mammalian autophagosome formation and their immune functions 


Protein 
complex 


Function of protein complex in autophagy Specific protein General properties 


Immunological/host defence functions 


Nucleation step of autophagosome formation 


ULK This complex is negatively regulated ULK1/2 Protein kinase, Antibacterial*”****; antiviral*©* 
complex by mTORC1 in a nutrient-dependent phosphorylated by mTORC1 
manner. After autophagy induction, 
this complex translocates to early ATG13 Phosphorylated by mTORC1 Unknown 
autophagic structures. Although FIP200 Scaffold for ULK1/2 and Maintains numbers of fetal haematopoietic stem cells’2 
FIP200 and ATG13 are known ATG13 
to be phosphorylated by ULK1, : 
physiologically relevant substrates ATG101 Interacts with ATG13 Unknown 
remain unknown. FIP200 and 
ATG101 may have functions similar 
to yeast Atg17, 29 and 31, although 
they show no sequence similarity 
with these proteins. 
Class III Beclin 1 is negatively regulated VPS34 PI(3) kinase Antiviral*®*; phagosome maturation®? 
PI(3)K by BCL2 and by BCL-x, through 
complex direct binding. This complex vest) Myristoylated HMoknown 
produces PtdIns(3)P, probablyon —_Beclin 1 BH3-only protein, interacts Antibacterial*®**5*; antiviral*®*°”°°; apoptotic corpse 
the ER. VPS34, VPS15 and beclin with BCL2 and BCL-X, clearance™; decreases inflammation in tumours®; regulates 
1 are shared with the UVRAG _ germinal centre induction®; phagosome maturation?!*3"9; 
complex, which seems to function interacts with TLR signalling adaptors? 
in the late endocytic pathway. - : 
Rubicon negatively regulates ATG14 Autophagy-specific subunit Unknown 
autophagosome-lysosome fusion AMBRA1 Interacts with and activates = Unknown 
through interaction with the UVRAG beclin 1 
complex. 
UVRAG A VPS38 homologue; Unknown 
interacts with class C VPS 
(HOPS) complex 
Rubicon Interacts with beclin 1 Unknown 
Others DFCP1 forms an omegasome on ATG2 Interacts with Atg18 in yeast — Antiviral*®* 
a EC Aea Wiecan ATGO Transmembrane protein Antiviral*®*; inhibits innate immune signalling* 
VMP1 are present on the autophagic WIPI1-4 PtdIns(3)P-binding proteins © Unknown 
membrane. ATG9 also exists in other ae : 
compartments such as endosomes DFCP1 PtdIns(3)P-binding ER protein Unknown 
and the Golgi apparatus. VMP1 Transmembrane protein Unknown 
Elongation step 
ATG12- The ATG12-ATG5-ATG16L1 dimer is ATG12 Ubiquitin-like, conjugates to —_Antiviral*®*; antibacterial**; antigen presentation’**”*; inhibits 
conjugation important for LC3-PE conjugation. ATG5 type | IFN production” 
aye ae pevlleantie sepia is seated ATG7 E1-like enzyme a ori erae tea ra tere 
is essential for proper elongation of Pp ae Oso: meture 1h) ‘ malas per a fae os 
éhe isolation:membrane. intestinal immune epithelial cell function”’; inhibits type | IFN 
production’®; inhibits pro-inflammatory cytokine production® 
ATG10 E2-like enzyme Unknown 
ATG5S Conjugated by ATG12 Antiviral*®4°*; antibacterial!3°%547*1: antiparasitic®®; antigen 
presentation®’**’3; phagosome maturation*!; apoptotic 
corpse clearance®™; maintains number of T cells**’?; maintains 
number of Bla B cells™; intestinal immune epithelial cell 
function”; inhibits type | IFN production’””®; increases type | 
IFN production’® 
ATG16L1 Homodimer, interacts with Antibacterial®’**; antigen presentation”; Crohn’s disease risk 
ATG5 allele®; intestinal immune epithelial cell function®!*°; inhibits 
pro-inflammatory cytokine production® 
LC3- The formation of LC3-PE LC3 Ubiquitin-like, conjugates Antiviral*®*; antibacterial***; antigen presentation’; 
conjugation conjugates and their (GATE16, to PE adaptor for substrates of selective autophagy of microbes 
system deconjugation by ATG4 is GABARAP) (xenophagy)*841! 
important for isolation membrane ; Bie 
elongation and/or complete ATG4A-D a ar ala sila Antiviral*** 
closure. LC3 is present on both Yar olas’: GECOniUeaung: 
the inner and outer membrane nee 
of the autophagosomes, and also ATG7 E1-like enzyme Antiviral******. antibacterial***; antigen presentation®*; 
serves as an adaptor for selective phagosome maturation*'; maintains number of T cells**’?; 
substrates such as p62, NBR1, intestinal immune epithelial cell function®°; inhibits type | IFN 
NDP52 and the yeast mitophagy production”®; inhibits pro-inflammatory cytokine production®? 
protein Atg32. : ~~ 
ATG3 E2-like enzyme Antiviral** 


The components of the autophagy machinery that have been shown to participate in the immune and inflammatory processes depicted in Fig. 3. Owing to space limitations, primary papers are cited only for 
those citations also included in the main text; otherwise, see references contained within cited review articles. 
*Function observed in model organism (for example, Dictyostelium discoideum, Nicotiana benthamiana, Arabidopsis thaliana, Drosophila melanogaster or Caenorhabditis elegans). 
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This is perhaps consistent with other evidence that beclin 1-class III 
PI(3)K complexes may function in autophagosomal maturation (in 
addition to vesicle nucleation), a process that can be regulated by 
other beclin-1-interacting partners such as rubicon (Table 1). At the 
final step of autophagosome formation, elongation of the isolation 
membrane and/or completion of enclosure require two ubiquitin-like 
conjugates. The first is the ATG12-ATG5 conjugate, which is produced 
by the ATG7 (E1-like) and ATG10 (E2-like) enzymes, and functions 
as a dimeric complex together with ATG16L] (ref. 19) . The second is 
the phosphatidylethanolamine (PE)-conjugated ATG8 homologues — 
LC3, GATE16 and GABARAP — which are produced by the ATG7 
and ATG3 (E2-like) enzymes”. Although the proteins involved in 
autophagosome membrane formation have been characterized as dis- 
crete complexes (Table 1), several potential interconnections between 
components of the different complexes were identified by a recent 
proteomics study”. Such interconnections may function in autophago- 
some membrane formation or other distinct cellular functions. For 
example, the ATG12-ATG3 conjugate is implicated in mitochondrial 
homeostasis but not in autophagosome membrane formation”. 

In addition to the ER, other membranes may be involved in autophago- 
some formation (Fig. 1). ATG9, another multispanning membrane pro- 
tein, is essential for autophagy” and traffics between the trans-Golgi 
network, endosomes and autophagosome precursors™. Studies suggest 
that mitochondria, the plasma membrane and the nuclear membrane 
could also be membrane sources for autophagosome formation”. 
However, the lack of detection of specific protein markers for these 
structures on the autophagosomal membrane leaves the decades-old 
question of the membrane source of the autophagosome unanswered. 
It is possible that cells may use different membrane sources to form the 
autophagosome in different contexts, thereby permitting specialization 
of membrane dynamics in a manner that allows divergent autophagy- 
inducing signals to stimulate the capture of spatially distinct cargo. 


Selective autophagy tackles microbes 

Autophagy was originally considered to be a non-selective bulk deg- 
radation process, but it is now clear that autophagosomes can degrade 
substrates in a selective manner”. In addition to endogenous substrates, 
autophagy degrades intracellular pathogens in a selective form of 
autophagy, termed xenophagy. Similar to bulk autophagy (such as that 
induced by nutrient deprivation) and other forms of selective autophagy 
(such as degradation of damaged mitochondria, peroxisomes, aggre- 
gate-prone proteins or damaged ER), the precise membrane dynamics 
and specificity determinants of xenophagy are not fully understood. 
Nonetheless, considerable advances have been made, and interesting 
similarities and differences are beginning to emerge between cellular 
recognition and degradation of self versus foreign microbial compo- 
nents through autophagy-like pathways (Figs 1 and 2). 

The vacuoles used for the engulfment of intracytoplasmic bact- 
eria are similar to autophagosomes, and their formation requires the 
core autophagy machinery. But one apparent difference is the vacuole 
size; for example, the diameter of group A Streptococcus-containing 
autophagosome-like vacuoles (GcAV) can be as big as 10 um. These 
large GcAVs are generated by the RAB7-dependent fusion of small iso- 
lation membranes”. By contrast, the formation of starvation-induced 
autophagosomes requires RAB7 later in the autophagy process, at the 
autophagosome-lysosome fusion step. 

A more complex question is how autophagosomes (or components 
of the autophagy pathway) capture pathogens that are inside vacu- 
olar compartments (Fig. 2). There are at least four general pathways 
that may be used for autophagy-protein-dependent targeting of bac- 
teria to the lysosome. These include autophagy-protein-facilitated 
fusion of bacteria-containing phagosomes with lysosomes, the 
envelopment of bacteria-containing phagosomes or endosomes by 
autophagosomal membranes, the fusion of bacteria-containing phago- 
somes or endosomes with autophagosomes, or the xenophagic capture 
of bacteria that have escaped inside the cytoplasm. In some cases, the 
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route of autophagy-dependent targeting to the lysosome has been well 
defined, such as for group A Streptococcus that escapes from endo- 
somes”. For several bacteria, however, the precise route is unclear. Many 
studies define bacterial autophagy as the co-localization of bacteria and 
LC3, but we now know that LC3 can decorate membranous compart- 
ments other than autophagosomes (including phagosomes). 

Several lines of evidence suggest that autophagy proteins function 
more broadly, not only in classical macroautophagy, but also in the 
process of phagolysosomal maturation during antigen presentation and 
microbial invasion. Autophagy proteins are required for the fusion of 
phagosomes that contain Toll-like receptor (TLR)-ligand-enveloped 
particles with lysosomes in macrophages”, and for the fusion of phago- 
somes that contain TLR-agonist-associated apoptotic cell antigens with 
lysosomes in dendritic cells during MHC class II antigen presentation”. 
The self ligand and cell-surface receptor SLAM functions as a microbial 
sensor that recruits the beclin 1-class HI PI(3)K complex to phagosomes 
containing Gram-negative bacteria, facilitating phagolysosomal fusion 
and activation of the antibacterial NADPH oxidase (NOX2) complex”. 
In addition, the engagement of TLR or Fcy receptors during phago- 
cytosis recruits LC3 (and ATG12) to the phagosome through NOX2- 
dependent generation of reactive oxygen species (ROS)*. Thus, in 
bacterial infections, a paradigm is emerging in which the coordinated 
regulation of microbial sensing, phagolysosomal maturation and anti- 
bacterial activity involves the recruitment of autophagy proteins to the 
phagosome. Asa corollary, an interesting speculation is that impaired 
recruitment of autophagy proteins to the phagosome may contribute to 
the pathogenesis of chronic granulomatous disease, a genetic disorder 
caused by mutations in the NOX2 gene (also known as CYBB) and char- 
acterized by recurrent bacterial and fungal infections and inflammatory 
complications, such as inflammatory bowel disease. 

Another autophagosome-independent function of autophagy proteins 
in pathogen destruction has been described in interferon-y (IFN-y)- 
treated macrophages infected with the parasite Toxoplasma gondii. The 
parasite-derived membrane, termed the parasitophorous vacuole, under- 
goes destruction through a mechanism that involves ATG5-dependent 
recruitment of the immunity-related GTPase proteins to the parasitopho- 
rous vacuole***, leading to the death of the parasite in the infected cell*?”, 
Together, these studies suggest that autophagy proteins have diverse roles 
in membrane dynamics to benefit the host in the removal of invading 
pathogens (Fig. 2), through xenophagy, phagolysosomal maturation, the 
recruitment of molecules that damage pathogen-derived membranes, and 
presumably, many other as yet undiscovered mechanisms. 

The mechanisms that cells use to target intracellular bacteria (and 
probably viruses) to autophagosomal compartments are notably similar 
to those used for selective autophagy of endogenous cargo. Cellular 
cargo is commonly targeted to autophagosomes by interactions between 
a molecular tag (such as polyubiquitin), adaptor proteins such as p62 
(also known as SQSTM1 or sequestome 1) or NBR1 (which recognize 
these tags and contain an LC3-interacting region (LIR) characterized 
by a WXXL or WXXI motif), and LC3 (ref. 28). These adaptor mol- 
ecules enable autophagy to target designated cargo selectively to nascent 
LC3-positive isolation membranes. As reviewed elsewhere’, a similar 
mechanism involving ubiquitin and p62 seems to be involved in the 
targeting of intracellular bacteria, such as Salmonella enterica serotype 
Typhimurium (S. Typhimurium), Shigella flexneri and Listeria mono- 
cytogenes, to autophagosomes. 

After escape into the cytoplasm or in vacuolar membrane compart- 
ments damaged by type III secretion system (T3SS) effectors, bacteria 
or bacteria-containing compartments, respectively, may become coated 
with ubiquitin and associate with p62 and nascent LC3-positive isolation 
membranes. The autophagosomal targeting of Salmonella also requires 
another cellular factor, NDP52 (nuclear dot protein 52), an autophagy 
adaptor protein that, like p62, contains an LIR and ubiquitin-binding 
domains and restricts intracellular bacterial replication. A ubiquitin- 
independent pathway (that does not involve p62 or NDP52) could also 
function in targeting damaged Salmonella-containing vacuoles (SCVs) 
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Figure 2 | Possible autophagy-protein-dependent pathways of pathogen 
degradation. Possible pathways involving the autophagy machinery by 
which viruses, bacteria (and damaged membranes of bacteria-containing 
vacuoles) and parasites may be targeted to the lysosome. Adaptor refers 


to the autophagosome. In this pathway, a lipid second messenger, diacyl- 
glycerol, acts as a signal for the co-localization of SCVs with LC3-pos- 
itive autophagosomes by a mechanism that involves protein kinase C 
and its downstream targets, JNK and NADPH oxidase”. The autophagic 
targeting of a cytoplasmic positive-strand RNA virus, Sindbis virus, also 
occurs in a ubiquitin-independent manner, but involves the interaction 
of p62 with the viral capsid protein*’. Thus, diverse molecular strategies, 
including ubiquitin-dependent and -independent mechanisms, may be 
used to target microbes inside the cytoplasm or vacuolar compartments 
to the autophagosome. 

Beyond targeting intracellular pathogens for degradation, p62 may 
have further beneficial effects in infected host cells. For example, Shigella 
vacuolar membrane remnants generated by bacterial T3SS-dependent 
membrane damage are targeted by polyubiquitination, p62 and LC3 for 
autophagosomal degradation” (Fig. 2). These membrane remnants also 
accumulate numerous molecules involved in sensing and transduction 
of pathogen-associated molecular pattern (PAMP) and danger-asso- 
ciated molecular pattern (DAMP) signals, and there is an increase in 
nuclear factor-«B (NF-«B)-dependent cytokine production, ROS pro- 
duction and necrotic cell death in autophagy-deficient cells. Thus, the 
ubiquitin—p62-dependent autophagic targeting of pathogen-damaged 
membranes could help to control detrimental downstream inflam- 
matory signalling during bacterial invasion into host cells. Another 
emerging concept is that selective autophagy of viral proteins, similar 
to selective autophagy of aggregate-prone toxic cellular proteins, may 
protect post-mitotic cells such as neurons against cell death. For exam- 
ple, in Sindbis-virus-infected mice with neuron-specific inactivation 


Autolysosome 


Phagolysosome 


to the proteins shown in the cargo-recognition box in Fig. 1; however, as 
yet undiscovered adaptors may be involved in pathogen recognition, and 
pathogen targeting may involve ubiquitin-dependent or -independent 
mechanisms. 


of Atg5, there is an accumulation of Sindbis virus antigens (without 
increased levels of infectious virus), increased neuronal cell death and 
increased animal mortality”. Moreover, p62 is required for starvation 
and IFN-y-induced targeting of Fau (and perhaps other ubiquitylated 
protein complexes) to mycobacteria-containing phagosomes, resulting 
in the generation of antimycobacterial Fau-derived peptides”. The role 
of p62 in innate immunity is probably evolutionarily ancient, as the 
Drosophila p62 orthologue REF(2)P was originally identified in a screen 
for modifiers of sigma virus replication”. 

We speculate that p62, as well as the other known LC3-interacting 
adaptor proteins (NBR1 and NDP52), may represent the tip of the ice- 
berg in terms of cellular adaptor proteins that bind to ubiquitin (or other 
molecular tags) and target microbial substrates and cytosolic material 
to autophagosomes to coordinate innate immune responses. A recent 
proteomics study showed that the mammalian ATG8 family, which 
includes LC3, GATE16 and GABARAP, has 67 high-confidence inter- 
actions with other cellular proteins”. Some of these new ATG8-family- 
member-interacting partners may have an as yet undiscovered role in 
innate immunity. Another open question is whether the known proteins 
involved in selective autophagy of mitochondria (called mitophagy), 
such as Nix (also known as BNIP3L) and parkin™, also function in 
microbial autophagy. 


Autophagy and resistance to infection 

The autophagy pathway and/or autophagy proteins have a crucial role 
in resistance to bacterial, viral and protozoan infection in metazoan 
organisms. The genetic deletion or knockdown of autophagy genes 
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protects plants from viral, fungal and bacterial infection by prevent- 
ing the uncontrolled spread of programmed cell death during the 
plant innate immune or hypersensitive response”. In other organisms, 
autophagy proteins function in a cell-autonomous manner to control 
infection by intracellular pathogens. In Drosophila, autophagy gene 
mutation increases susceptibility to viral (vesicular stomatitis virus)** 
and bacterial (L. monocytogenes)” infection. In Dictyostelium and 
Caenorhabditis elegans, autophagy gene mutation increases suscepti- 
bility to lethal S. Typhimurium infection®. In mice, knockout of Atg5 
in macrophages and neutrophils increases susceptibility to infection 
with L. monocytogenes and the protozoan T. gondii®, and neuron-spe- 
cific Atg5 knockout increases susceptibility to central nervous system 
Sindbis virus infection®’. As noted in the next section, the autophagy 
pathway and proteins may also have ‘proviral’ or ‘probacterial’ effects 
in in vitro studies; however, in vivo evidence for such effects is so far 
lacking. The mechanisms by which autophagy genes mediate in vivo 
resistance to infection are not fully understood, but are likely to involve a 
combination of xenophagy, other autophagy-protein-dependent effects 
on microbial replication or survival, activation of innate and adaptive 
immune responses, and/or alterations in pathogen-induced cell death 
(Fig. 3). 

An important question is whether this function of autophagy in broad 
resistance to infection with intracellular pathogens extends to humans. 
Recent human genetic studies provide some clues. The immunity- 
related GTPase human IRGM, which regulates autophagy-dependent 
clearance of mycobacteria in vitro”, was identified as a genetic risk locus 
for tuberculosis in a West African population”. Numerous studies have 
shown a crucial role for autophagy in defence against mycobacterial 
infection in human cells’, and a genome-wide analysis of host genes 
that regulate Mycobacterium tuberculosis replication demonstrated 
that a predominance of factors were autophagy regulators”’. Thus, it 
is possible that autophagy has a central role in resistance to one of the 
most important global infectious diseases — tuberculosis. Mutations in 
NOD2, which encodes an intracellular pathogen-recognition receptor 
(nucleotide-binding oligomerization-domain-containing protein 2) that 
functions in bacterial autophagy”, are also associated with susceptibil- 
ity to infection with another mycobacterial agent, Mycobacterium leprae, 
the aetiological agent of leprosy™. An exciting future venture will be to 
confirm whether IRGM, NOD2 and other autophagy-related genes are 
involved in resistance to infection with mycobacteria and other infec- 
tions in further human populations and, if so, whether this resistance 
is mediated by autophagy. 


Microbes fight back 

Microbes undergo strong selective pressure to develop strategies to block 
host defence mechanisms; the number of such strategies is a surrogate 
measure of the importance of the host defence mechanism in immu- 
nity. As reviewed elsewhere’, viruses and intracellular bacteria have 
evolved several ways to adapt to host autophagy. They can antagonize 
autophagy initiation or autophagosomal maturation, evade autophagic 
recognition, or use components of the autophagy pathway to facilitate 
their own replication or intracellular survival. An emerging theme is 
that microbial antagonism of autophagy not only blocks the xenophagic 
degradation of intracellular pathogens, but also blocks the functions of 
autophagy in innate and adaptive immunity. A relatively unexplored yet 
crucially important frontier is how microbial antagonism may contrib- 
ute more broadly to the role of microbes in diseases characterized by 
defective autophagy, such as cancer, neurodegenerative diseases, ageing 
and, potentially, autoimmune and inflammatory diseases. 

Viral strategies to shut off autophagy include the blockade of posi- 
tive upstream regulators of autophagy (such as the IFN-inducible 
RNA-activated eIF2a protein kinase (PKR) signalling pathway), the 
activation of negative upstream regulators of autophagy (such as the 
nutrient-sensing TOR kinase signalling pathway) or direct antagonism 
of the autophagy machinery”. The overlapping functions of the eIF2a 
kinase signalling pathway in stress-induced general translational arrest, 
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Figure 3 | Functions of the autophagy pathway and/or proteins in 
immunity. A summary of the known functions of the autophagy pathway 
and/or proteins in adaptive and innate immunity, and as effectors during 
infection. 


transcriptional activation of stress-response genes and stress-induced 
autophagy enable viruses to disarm several facets of the cellular stress 
response to infection by one mechanism — that is, antagonism of eIF2a 
kinase signalling. The mTOR signalling pathway has a central role in the 
control of cell growth and metabolism, and interestingly, many of the 
viruses that activate mTOR are oncogenic (for example, Epstein-Barr 
virus, Kaposi’s sarcoma-associated herpesvirus, hepatitis B virus and 
retroviruses). This suggests another type of pluripotent viral weapon 
— one that can promote oncogenesis by simultaneously inactivating 
autophagy and promoting cell growth through TOR activation. HIV 
envelope protein-dependent activation of mTOR signalling is also 
proposed to be a mechanism for HIV evasion of innate and adaptive 
immune responses in dendritic cells, including the degradation of 
incoming virions by lysosomes, blockade of HIV transfer to CD4* T 
cells, stimulation of TLR4 and TLR8 ligand responses, and presentation 
of HIV Gag antigen to CD4"* T cells”. It will be important to determine 
whether these effects of HIV-mediated mTOR activation and autophagy 
inhibition contribute to impaired dendritic cell function during HIV 
infection in vivo. 

Several viral proteins target the core autophagy protein beclin 1. 
Autophagosome initiation is blocked by interactions between beclin 1 
and the herpes simplex virus 1 (HSV-1) neurovirulence factor ICP34.5 
or the oncogenic y-herpesvirus-encoded viral BCL2-like proteins, 
whereas autophagosome maturation is blocked by interactions between 
beclin 1 and the HIV accessory protein Nef or the influenza virus matrix 
protein 2 (ref. 55). The interactions between beclin 1, HSV-1 ICP34.5 
and viral BCL2 are probably physiologically important in vivo; a mutant 
HSV-1 virus lacking the beclin-1-binding domain of ICP34.5 is attenu- 
ated in mouse models of encephalitis (presumably through its failure 
to control xenophagy and innate immunity)” and of corneal disease 
(through its failure to control adaptive immunity)”. Moreover, a mouse 
y-herpesvirus that encodes a mutant viral BCL2 unable to bind to 
beclin 1 demonstrates impaired ability to maintain chronic infection”. 
Thus, viral antagonism of host autophagy can manipulate distinct 
aspects of viral pathogenesis and immunity in vivo. 

It is not yet clear whether compared with other autophagy proteins, 
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beclin 1 is preferentially targeted by viral virulence proteins because of 
its central role in autophagosome formation, or more likely, whether we 
are just beginning to identify pairs of viral proteins and their autophagy 
pathway targets. In support of the latter, viral FLICE-like inhibitors 
encoded by Kaposi's sarcoma-associated herpesvirus and molluscum 
contagiosum virus suppress autophagy by interacting with the ATG3 
E2-like enzyme, thereby preventing it from binding and processing LC3 
(ref. 60). 

Bacteria possess diverse strategies to avoid degradation by 
autophagolysosomal pathways. As reviewed elsewhere’”*, many bac- 
teria that reside in phagosomes or other vacuolar compartments have 
methods to inhibit lysosomal fusion or maturation, which, in the case 
of mycobacteria, can be partially overcome by treatments that stimu- 
late autophagy. Another possible mechanism for bacterial evasion of 
autophagy has emerged from a genome-wide screen to identify host 
factors that regulate the intracellular survival of M. tuberculosis”. 
According to bioinformatics analyses, M. tuberculosis infection of 
human macrophage-like cells activates cellular pathways that inhibit 
autophagy. Intracellular bacteria that escape into the cytoplasm, such 
as S. flexneri and L. monocytogenes, use strategies to camouflage them- 
selves to avoid autophagic recognition. The Shigella T3SS effector IcsB 
competitively binds to VirG, thereby preventing the interaction between 
ATG5 and VirG, a bacterial surface protein required for actin-based 
motility and Shigella targeting to autophagosomes”. The Listeria protein 
ActA interacts with cytosolic actin polymerization machinery (ARP2/3, 
VASP and actin), which blocks bacterial association with ubiquitin, p62 
recruitment and autophagic targeting”. The precise mechanism of this 
block is unknown, but it has been proposed that the ActA-dependent 
recruitment of host cell cytoskeletal proteins may enable the bacterium 
to disguise itself as a host cell organelle”. This concept sheds light on 
the autophagy pathway in a fundamental aspect of immunology — the 
basis for discrimination between self and non-self. 

Microbes have evolved not only to antagonize autophagy (asa cellular 
defence mechanism that threatens their survival), but also to exploit 
its components and functions in membrane trafficking for their own 
self-serving purposes”. An early concept in the field is that autophago- 
somes may serve as a protected niche for intracellular bacteria (provided 
fusion with acidic compartments is blocked) and/or serve as a source 
of nutrients for intracellular pathogens (which would require intact 
autophagolysosomal fusion)". Trafficking of the intracellular bacteria 
Yersinia pseudotuberculosis to acidic compartments was recently shown 
to be enhanced by genetic inhibition of autophagy”. This seemingly 
contradicts other evidence that the autophagy proteins promote phago- 
somal maturation, but is consistent with the concept that autophago- 
somes function as a protected intracellular niche for bacteria. The role 
of the autophagy machinery in promoting and/or inhibiting vacuolar 
acidification — and the counter effects of microbes that reside in vacu- 
olar compartments — needs to be explored further. 

The function of autophagy proteins in membrane formation and/ 
or trafficking is exploited by numerous viruses, including poliovirus, 
rotavirus, HIV, coronaviruses, Dengue virus, and the hepatitis B and C 
viruses”. Autophagosomes (defined as LC3-positive membranes, see 
caveat below) may act as a scaffold for intracellular membrane-associ- 
ated replication of certain cytoplasmic RNA viruses”. Autophagy may 
assist in HIV biogenesis, because the processing of the HIV envelope 
precursor protein Gag and extracellular viral release are enhanced by the 
autophagy machinery”. Similarly, autophagy proteins are required for 
maximal levels of poliovirus egress”. Another newly defined proviral 
function of autophagy is its role in productive hepatitis C virus replication; 
several different autophagy proteins (such as beclin 1, ATG4B, ATGS, 
ATG7 and ATG12) assist in the translation of incoming, but not progeny, 
viral RNA™. ATG7 and class III PI(3)K activity also enhance hepatitis B 
virus DNA replication®. 

The mechanisms by which autophagy proteins facilitate the replication 
and/or egress of certain viruses are not yet understood. Some observa- 
tions may relate to the role of autophagy proteins in remodelling the ER 
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(vis-a-vis viral replication) or the role of autophagosomes in fusing with 
multivesicular bodies (vis-a-vis viral egress). It is possible that autophagy 
proteins function to provide membrane for viral replication complexes 
or translation machinery. This may be true for viruses such as hepatitis C 
virus, for which genetic knockdown of several different autophagy genes 
decreases productive replication™. However, for other viruses such as 
coronaviruses, the biogenesis of double-membrane, ER-derived vesicles 
used for replication proceeds through a pathway that involves the non- 
lipidated form of LC3 (LC3-I) but not the general autophagy machinery”. 
Thus, caution must be exercised in interpreting the significance of the 
co-localization (or biochemical interaction) of viral proteins and LC3, as 
LC3 may have autophagy-independent roles in membrane dynamics. 


Autophagy regulation by immune signalling molecules 

The central importance of autophagy in immunity is further under- 
scored by the multitude of immune-related signalling molecules that 
regulate autophagy. As reviewed in detail elsewhere”*”*, autophagy is 
induced by different families of pathogen-recognition receptors (such 
as TLRs, NOD-like receptors and the double-stranded RNA-binding 
protein PKR), DAMPs (such as ATP, ROS and misfolded proteins), 
pathogen receptors (such as CD46), IFN-y and downstream immunity- 
related GTPases, and DAP kinase, JNK, CD40, tumour necrosis factor-a 
(TNF-a), inhibitor of NF-«B (IKK) and NF-«B (Fig. 1). High mobil- 
ity group box (HMGB) proteins have also been shown to function as 
both universal sensors of nucleic acids in innate immune signalling” 
and inducers of autophagy®. Autophagy is inhibited by BCL2, NF-«B, 
T helper 2 (T,,2) cytokines and the canonical nutrient-sensing insulin- 
AKT-TOR pathway. Inactivation of this nutrient-sensing pathway may 
contribute to vesicular stomatitis virus stimulation of autophagy in Dro- 
sophila*®, and autophagy activation in C. elegans with loss-of-function 
mutations in this pathway may mediate pathogen resistance in long-lived 
mutant nematodes”. Thus, both ‘immune-specific and more general 
nutrient-response signals control autophagy in response to infection. 

Studies with vitamin D3 have uncovered a possible link between 
nutrition, innate immunity and the control of autophagy during myco- 
bacterial infection. Low vitamin D3 levels are associated with increased 
susceptibility to tuberculosis. Vitamin D3 generates an antimycobacte- 
rial peptide, cathelicidin, and induces autophagy and mycobacterial 
killing in human monocytes through cathelicidin-dependent effects”. 
Although cathelicidin is required for vitamin-D3-dependent transcrip- 
tional upregulation of autophagy genes such as BECN1 and ATGS, and 
vitamin D3 enhances the recruitment of cathelicidin to autophago- 
somes, it is not yet clear how cathelicidin promotes autophagy. None- 
theless, these observations may begin to provide some insight into the 
century-old Nobel prize award (Niels Ryberg Finsen, 1903) for the use 
of ultraviolet-light therapy (which generates active vitamin D3) in the 
treatment of diseases such as tuberculosis. 

In most instances, the mechanisms of autophagy control by immune- 
related signalling molecules are not understood. However, there are 
some examples of specific interactions between immune signals and 
autophagy proteins that may be relevant to these mechanisms. For 
example, the interaction between beclin 1 and BCL2 (which inhibits 
its activity) is thought to be disrupted by the TLR adaptors MyD88 
and TRIE, as well as by HMGB1, which bind to beclin 1 and displace 
BCL2 (refs 3, 68). Two intracellular sensors responsible for inducing 
autophagy in response to bacterial infection, NOD1 and NOD2, inter- 
act with ATG16L1 and recruit it to the plasma membrane, resulting in 
enhanced association of invasive bacteria (S. flexneri) with LC3 (ref. 53). 
Interestingly, a NOD2 mutation associated with Crohn’s disease impairs 
ATGI6L1 plasma membrane recruitment and bacterial co-localization 
with LC3 (ref. 53). 

The identification of other possible protein-protein interactions 
between core autophagy proteins and immune signals by a large 
proteomics screen”’ has the potential to foster further advances in 
understanding how different immune signals regulate the autophagy 
machinery. For example, tectonin proteins with multivalent 
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6-propeller folds are known to function in pathogen recognition and 
innate immunity in invertebrates”. Thus, the interactions between 
previously uncharacterized human proteins of this tectonin family, 
TECPRI1 and TECPR2, with the ATG5-ATG12-ATGI16L1 complex 
and ATG8 family members, respectively”, may contribute to patho- 
gen-induced autophagy stimulation or selective autophagic targeting 
of pathogens in mammals. 

Further links between immune signalling molecules and autophagy 
regulation were suggested by a genome-wide short interfering RNA 
screen”. The analysis identified 219 genes that suppressed basal 
autophagy, largely in a mammalian TOR complex 1 (mTORC1)- 
independent fashion. These included several cytokines such as CLCF1, 
LIF, IGF1, FGF2 and the chemokine SDF1 (also known as CXCL12), 
as well as cellular signalling molecules regulated by cytokines such as 
STATS. These findings raise the possibility that cytokines may have 
a broader role in the control of autophagy than previously thought. 
Moreover, because these cytokine signalling pathways are important 
in immune cells, another central question is to what extent cytokine- 
mediated regulation of autophagy governs immune cell function. Given 
the general function of autophagy in cellular homeostasis’, and the more 
specific functions in regulating immune and inflammatory signalling 
(discussed in ‘Regulation of immune signalling by autophagy proteins’), 
cytokine-mediated changes in autophagy levels in immune cells may 
have a central role in immunity and inflammation. 


Autophagy and adaptive immunity 

Autophagy proteins function in adaptive immunity, including in the 
development and homeostasis of the immune system and in antigen 
presentation (Table 1 and Fig. 3). The knockout of different autophagy 
genes in specific lymphocyte populations in mice has shown a crucial 
role for autophagy proteins in the maintenance of normal numbers of 
BlaB cells, CD4* T cells, CD8* T cells and fetal haematopoietic stem 
cells*“*”. In T cells, in which mitochondrial numbers are developmen- 
tally regulated during the transition from thymocyte to mature circu- 
lating T cell, the developmental defect in autophagy- deficient cells may 
be related to the defective clearance of mitochondria“. Another cru- 
cial function of autophagy in the development and homeostasis of the 
immune system is the elimination of autoreactive T cells in the thymus™. 
High levels of autophagy are present in thymic epithelial cells, in which 
autophagy participates in the delivery of self-antigens to MHC class II 
loading compartments. Genetic disruption of Atg5 in thymic epithelial 
cells leads to the altered selection of certain MHC class II restricted 
T-cell specificities and autoimmunity”. Beyond these functions in 
lymphocyte survival and thymic negative selection, autophagy may 
exert other functions in lymphocyte differentiation, perhaps, in part, 
indirectly through effects on cytokine expression (see the next section). 
It is not yet known whether autophagy is involved in the development 
and/or homeostasis of immune cell populations other than lymphocytes 
and haematopoietic stem cells. 

Autophagy proteins may participate in different facets of antigen 
presentation, including the delivery of endogenous antigens for MHC 
class II presentation to CD4° T cells”*”, the enhancement of antigen 
donor cell cross-presentation to CD8* T cells”, dendritic cell cross- 
presentation of phagocytosed antigens to CD4* T cells” and, in one 
report, MHC class I presentation of intracellular antigens to CD8* 
T cells”. The discovery that autophagosomes could deliver endogenous 
antigens to MHC class II loading compartments sheds light on one 
of the central mysteries of antigen presentation — how the immune 
system elicits CD4* T-cell responses to antigens that originate in all 
parts of the cell. The autophagic delivery of endogenously synthesized 
antigens for MHC class II presentation has been demonstrated in vitro 
for certain viral antigens”, and probably explains the essential role of 
Atg5 in vivo in negative thymic selection’’. However, the relative impor- 
tance of this pathway in antigen presentation during infection in vivo is 
not yet known. There is nonetheless interest in exploiting this pathway 
for optimizing vaccine-elicited CD4* T-cell responses, by either pre- 
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treating dendritic cells with autophagy-inducing agents in cell-based 
vaccine strategies or fusing antigens with LC3 to enhance their targeting 
to autophagosomes’. 

Of note, autophagy proteins are required for antigen cross-presen- 
tation during infection in vivo”. Dendritic-cell-specific deletion of 
Atg5 in mice results in defects in priming CD4* T-cell responses after 
HSV and Listeria infections, and mice succumb more rapidly to lethal 
disease after intravaginal HSV infection. Afg5-deficient dendritic cells 
have normal migration, innate responses, endocytic and phagocytic 
activity and cross-presentation of peptides on MHC class I molecules. 
However, they exhibit defects in phagosome-to-lysosome fusion and 
in cross-presentation by MHC class II molecules of phagocytosed anti- 
gens containing TLR ligand. Thus, the interior of the phagosome, like 
that of the autophagosome, is a cellular compartment that autophagy- 
protein-dependent antigen presentation accesses to generate peptides 
for presentation to CD4" T cells. A potential evolutionary advantage of 
this autophagy-protein-dependent cross-presentation is that, by del- 
egating antigen presentation duties to uninfected dendritic cells, the 
host can bypass the blockade of antigen presentation that may result 
from microbial antagonism of autophagy in infected cells. 


Regulation of immune signalling by autophagy proteins 
In response to infection, the host must activate those arms of the innate 
and adaptive immune system (including autophagy-dependent func- 
tions; Fig. 3) that help to control infection while, in parallel, triggering 
specific responses that limit detrimental, uncontrolled immune activa- 
tion and inflammation. An exciting new frontier in autophagy research 
is the growing recognition of the function of autophagy proteins in 
achieving this balance (Fig. 4). 

Autophagy proteins function in both the activation and inactiva- 
tion of innate immune signalling’. The autophagy pathway activates 
type IIFN production in plasmacytoid dendritic cells by delivering viral 
nucleic acids to endosomal TLRs”*. By contrast, autophagy proteins 
negatively regulate RIG-I-like receptor (RLR)-mediated induction of 
type I IFN production through the autophagic elimination of damaged 
mitochondria (and reduction of ROS)”, and by the binding of ATG5- 
ATGI12 to caspase recruitment domains of RLR signalling molecules”. 
Moreover, the autophagy protein ATG9A, but not ATG7, negatively 
regulates the activation of STING, a transmembrane protein that is 
required for efficient activation of type I IFN and pro-inflammatory 
cytokine production in response to stimulatory DNA”. Thus, it seems 
that autophagy proteins can negatively regulate IFN production by both 
autophagy-dependent and -independent mechanisms. With respect 
to the latter, different autophagy proteins may be specialized to target 
different innate immune signalling molecules. 

The autophagy pathway and/or proteins also have a crucial role in 
the control of inflammatory signalling. A major effect is on the regula- 
tion of inflammatory transcriptional responses. Increased levels of the 
adaptor protein p62, which accumulates in autophagy-deficient cells, 
activate the pro-inflammatory transcription factor NF-«B through a 
mechanism involving TRAF6 oligomerization”. The accumulation of 
p62 in Atg7-deficient hepatocytes results in enhanced activity of the 
stress-responsive transcription factor NRF2 and NRF2-dependent liver 
injury®’. In addition, Paneth cells (intestinal immune epithelial cells) 
from mice hypomorphic for Atg16l1 (Atg1611™) show enhanced tran- 
scription of pro-inflammatory cytokines and adipokines”. 

A second important effect of autophagy proteins on inflamma- 
tory signalling is at the level of the inflammasome. This complex 
contains NOD-like receptor cryopyrin proteins, the adaptor protein 
ASC and caspase 1, and is activated by cellular infection or other 
stress to promote the maturation of pro-inflammatory cytokines 
such as interleukin-18 (IL-1) and IL-18 (ref. 4). Atg1611- or Atg7- 
deficient mouse macrophages produce increased levels of mature 
IL-1 and IL-18 after TLR4 stimulation by endotoxin. In addition, 
mouse chimaeras engrafted with Atg16l1 “fetal liver haematopoietic 
progenitors have increased serum concentrations of IL-16 and IL-18 
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Figure 4| Autophagy/autophagy proteins act to achieve a balance between 
activation and inactivation of innate immune signalling. A general 

model in which the levels of autophagy and autophagy proteins control 
disease in response to stressors. Normal autophagy protein function (green) 
contributes to balanced inflammatory and metabolic responses, resulting in 
protection against disease. Altered autophagy protein function (red) results in 
maladaptive inflammatory and metabolic responses, increased inflammation 
and more severe disease. 


after treatment with dextran sodium sulphate (DSS), which contrib- 
utes to increased DSS-induced colitis”. 

The mechanism(s) by which autophagy proteins negatively regu- 
late inflammasome activation are not yet understood. Mutually non- 
exclusive possibilities include direct interactions between autophagy 
proteins and inflammasome components, indirect inhibition of inflam- 
masome activity through autophagic suppression of ROS accumulation, 
or autophagic degradation of danger signals that activate the inflam- 
masome. In line with the latter model, the autophagic degradation of 
amyotrophic-lateral-sclerosis-linked mutant superoxide dismutase has 
been proposed to limit caspase 1 activation and IL-18 production”. 

In addition to regulating inflammatory signalling, the autophagy 
pathway may prevent tissue inflammation through its role in apoptotic 
corpse clearance. The efficient clearance of apoptotic corpses during 
development and tissue homeostasis prevents secondary necrosis, which 
releases danger signals (DAMPs) that trigger inflammation. Autophagy 
genes are essential for the heterophagic clearance of dying apoptotic 
cells during developmental programmed cell death (by the generation 
of ATP-dependent engulfment signals)", and the retinas and lungs of 
embryonic mice lacking Atg5 have a defect in apoptotic corpse engulf- 
ment that is associated with infiltration of inflammatory cells*. On the 
basis of growing evidence that autophagy proteins function in TLR- 
mediated phagolysosomal pathways, it is possible that autophagy also 
functions in phagocytes to facilitate apoptotic corpse clearance. Thus, 
in tissues such as the intestine, in which physiological regeneration 
involves continuous shedding or apoptosis of epithelial cells, autophagy- 
dependent functions in dying cells and/or phagocytic cells may promote 
efficient corpse clearance, thereby limiting inflammation. 


Autophagy and inflammatory disease 

Perturbations in autophagy-protein-dependent functions in immu- 
nity may contribute not only to increased susceptibility to infection, 
but also to chronic inflammatory diseases and autoimmune diseases. 
The only well-characterized link thus far is between mutations in 
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autophagy regulators and Crohn's disease, a chronic inflammatory 
disorder of the small intestine, in which a breakdown in clearance or 
recognition of commensal bacteria, as well as altered mucosal bar- 
rier function and cytokine production, is thought to lead to intestinal 
inflammation (Fig. 5). Other emerging links include the autoimmune 
disease systemic lupus erythematosus (SLE), inflammation-associated 
metabolic diseases such as obesity and diabetes, and inflammation 
associated with cystic fibrosis lung disease (Fig. 4). 

The role of autophagy proteins in Crohn’ disease was not suspected 
until genome-wide association studies identified three Crohn's disease 
susceptibility genes, IRGM, NOD2 and ATG16L1, that are involved in 
autophagy™. The IRGM risk allele contains a deletion in the promoter 
region of the gene that may be associated with changes in IRGM protein 
expression and may contribute to Crohn's disease, given IRGM’s role 
in autophagy-dependent control of bacterial infection”. However, this 
hypothesis has not yet been tested. The three major Crohn’s-disease- 
associated NOD2 variants (a frameshift mutant and two missense 
mutants) may be loss-of-function mutants, with impaired muramyl 
dipeptide (MDP)-induced inflammatory signalling*®. How the loss of 
function of a pro-inflammatory signal mechanistically contributes to 
an inflammatory disorder has been unclear, but the recently discovered 
links between NOD2 and autophagy may solve this conundrum. In 
primary immature human dendritic cells, NOD2 is required for MDP- 
induced autophagy, a process that is essential for the MHC class II pres- 
entation of bacterial antigens to CD4" T cells and for bacterial targeting 
to lysosomes”. Dendritic cells expressing Crohn’s disease NOD2 risk 
variants are defective in both of these functions”. Thus, in patients with 
Crohn's disease and NOD2 risk variants, aberrant autophagy-dependent 
bacterial clearance and immune priming could act as a trigger for intes- 
tinal inflammation. 

A mechanistic link may also exist between ATG16L1 mutation and 
Crohn's disease pathogenesis. Similar to findings with NOD2 vari- 
ants, dendritic cells from patients with the Crohn’s-disease-associated 
ATG16L1(T300A) risk variant are defective in presenting bacterial 
antigen to CD4' T cells”. However, it is not yet known how the T300A 
mutation affects the function of the mammalian ATG16L] protein. This 
mutation resides in the carboxy-terminal WD-repeat domain that is 
absent in yeast Atg16 and is dispensable for autophagy. Although some 
studies have suggested that the ATG16L1(T300A) variant has reduced 
autophagic clearance of enteric pathogens such as adherent-invasive 
Escherichia coli*’ or S. Typhimurium™, it remains controversial whether 
the risk versus protective alleles of ATG16L1 have differences in stability 
or antibacterial autophagic activity”. 

Despite the uncertain nature of the effects of the T300A mutation 
on ATGI6L1 function, Afg1611 mutation (null or hypomorphic alle- 
les) in mice results in abnormalities that are relevant to Crohn’s disease 
pathogenesis. As noted earlier, loss of Atg1611 function in mice results 
in enhanced TLR-agonist-induced pro-inflammatory cytokine produc- 
tion by macrophages”, enhanced DSS-induced colitis””’ and altered 
inflammatory gene transcriptional profiles in Paneth cells*’””. In addi- 
tion, the Paneth cells of mice expressing low Atg16l1 levels (Atg1611'™) 
show defects in the packaging and extrusion of antimicrobial granules 
into the gut lumen; Paneth cells from patients with Crohn’s disease and 
the ATG16L1(T300A) risk variant show similar defects*’. This suggests 
that, in addition to the overlapping functions of NOD2 and ATGI6L1 
in acommon bacterial-sensing pathway that promotes bacterial antigen 
presentation, ATG16L1 may have unique protective functions, includ- 
ing Paneth cell antimicrobial peptide release and the negative regula- 
tion of pro-inflammatory cytokine production. To connect the striking 
phenotypes in Atg16l1-mutant mice and the pathogenesis of Crohn's 
disease in humans with the ATG16L1(T300A) risk allele, the precise 
effects of the T300A mutation on ATGI16L1 protein function need to 
be uncovered. 

A new dimension in understanding the multifactorial basis of 
chronic inflammatory diseases such as Crohn's disease has emerged 
from the discovery that a virus trigger is required to observe intestinal 
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Figure 5 | The link between mutations in autophagy regulators and the 
chronic inflammatory disorder Crohn's disease. An overview of the many 
possible mechanisms by which defects in autophagy and autophagy protein 
function may contribute to the pathogenesis of a type of inflammatory bowel 
disease, Crohn's disease. A micrograph of a human small intestine from a 
patient with Crohn's disease is shown (centre), demonstrating the severe 
transmural inflammation that is characteristic of this disease. The postulated 


abnormalities in Atg1611'™ mice”. In mice raised in a pathogen-free 
facility, only Atg16l1°™ mice (and not wild-type mice) infected with a 
virus found in routine conventional animal facilities, a murine norovi- 
rus, showed abnormal Paneth cell granule secretion, Paneth cell pro- 
inflammatory gene-expression profiles, and intestinal inflammation in 
response to DSS treatment”. This mucosal inflammation depended on 
the presence of the microbiome and pro-inflammatory cytokines, as it 
was reversed by antibiotic treatment or by TNF-a or IFN-y inhibition. 
Thus, variations in a host autophagy gene, exposure to a specific virus 
and the microbiome can act together to trigger intestinal inflammation 
in mice that is similar to that in patients with Crohn’s disease. Although 
environmental factors, including the gut microbiome, have long been 
suspected to contribute to Crohn’s disease in genetically susceptible 
individuals, formal proof of this concept was lacking, and viruses were 
a previously unsuspected trigger. Another implication of this work is the 
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mechanisms by which defects in autophagy protein function might contribute 
to the development or perpetuation of intestinal inflammation are based on 
studies in vitro and animal models. There is no direct evidence that autophagy 
defects contribute to human Crohn's disease, although mutations in three 
autophagy-related genes, ATGI6L1, NOD2 and IRGM, are known to enhance 
risk of the disease. E, epithelium; IgM, immunoglobulin M; L, lumen; LA, 
lymphoid aggregates; TM, thickened muscle. Scale bar, 200 um. 


concept that autophagy proteins, through their diverse roles in immu- 
nity and the control of inflammation, may serve as a central rheostat 
that prevents inflammatory diseases triggered by environmental stress 
(Fig. 4). 

An important unanswered question is whether perturbations in 
autophagy may also result in inflammatory autoimmune disease. 
Genome-wide association studies have linked several single nucleo- 
tide polymorphisms (SNPs) in ATGS to SLE susceptibility” ’. SLE is 
a multifactorial, heterogeneous disease characterized by autoimmune 
responses against self-antigens generated from dying cells. Although the 
effects of these SNPs on ATGS expression and function are not known, 
the lack of Atg5-dependent negative thymic selection generates auto- 
immunity and multi-organ inflammation in mice”. Loss of other ATG5- 
dependent effects, including regulation of IFN and pro-inflammatory 
cytokine secretion”, clearance of dying cells** and dendritic cell 
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antigen presentation”, might also contribute to the autoimmunity and 
inflammation associated with SLE. Thus, a link between ATG5 muta- 
tion (or mutation of other autophagy genes) and SLE pathogenesis is 
biologically plausible, although not yet proven. 

Defects in autophagy may contribute to inflammation-associated 
metabolic diseases such as diabetes and obesity, which are both linked 
to insulin resistance. The metabolic inflammasome — a complex com- 
posed of signalling molecules such as PKR, eIF2a, JNK, IRS and IKK— 
may act as a link between ER stress and more global stress responses, 
including inflammation and metabolic dysfunction (as observed in 
insulin resistance and obesity)”. Although most components of the met- 
abolic inflammasome promote autophagy, the induction of autophagy 
by this signalling complex would be expected to serve as a negative- 
feedback mechanism that limits ER stress and disease progression. 
Consistent with this postulated protective effect of autophagy, hepatic 
suppression of the autophagy gene Afg7 in mice results in increased ER 
stress and insulin resistance”, and mice deficient in the autophagy adap- 
tor protein p62 develop mature-onset obesity and insulin resistance”. 
Furthermore, obesity is associated with the accumulation and activation 
of macrophages and subsets of T cells in adipose tissue and the produc- 
tion of cytokines such as TNF-a and IL-6 (ref. 97). Thus, the failure of 
autophagy-dependent control of ER stress, immune cell homeostasis, 
immune cell activation and/or pro-inflammatory cytokine secretion 
may contribute to inflammation-associated responses that underlie the 
pathogenesis of metabolic diseases. 

Another potential link between autophagy deficiency and chronic 
inflammation is in cystic fibrosis”, a life-threatening genetic disorder 
caused by mutations in the gene encoding the cystic fibrosis trans- 
membrane conductance regulator (CFTR). Mutations in CFTR lead 
to autophagy inhibition in lung epithelial cells through a mechanism 
that may involve ROS-mediated sequestration of the beclin 1-class III 
PI(3)K complex in perinuclear aggregates (redirecting it from its site of 
autophagy action at the ER). Restoration of beclin 1 and autophagy in 
cystic fibrosis epithelial cells rescues the disease phenotype, and antioxi- 
dants reverse the airway inflammation in a cystic fibrosis mouse model 
bya mechanism postulated to involve autophagy. 


Future directions 

The first series of studies demonstrating that the autophagy machin- 
ery is used to attack invading intracellular bacteria was published in 
2004 (refs 30, 99, 100). Although autophagy had been observed at the 
ultrastructural level in cells infected with intracellular bacteria and 
viruses decades earlier, these studies were a seminal advance. For the 
first time, pharmacological and genetic manipulation of autophagy, 
which built on the discoveries of the yeast screens that identified the 
autophagy machinery, challenged the very notion of autophagy as an 
‘auto’- (self), ‘phagy’ (eating) pathway. Indeed, we learned that the same 
genes that are used to orchestrate the degradation of self-constituents, 
either for nutritional/energy homeostasis or cellular damage control, 
are also used to orchestrate the degradation of foreign invaders, termed 
xenophagy. 

In the past few years, research in the field has uncovered new layers of 
complexity and functional diversity in terms of how this set of genes — 
originally characterized in the context of macroautophagy — may func- 
tion to protect multicellular organisms against not only the threats of 
infection but also the threats of the host’s own response to infection. The 
autophagy machinery does much more than form autophagosomes to 
engulf microbes — it somehow allows microbes in phagosomes and vacu- 
oles to be targeted to the lysosome; it enables crucial cells in the immune 
system to develop properly and perform some of their ‘normal’ functions 
(such as produce IEN, secrete antimicrobial peptides or present antigens to 
stimulate adaptive immunity); and it ensures that these responses do not 
become out of control by functioning in central immunological tolerance 
and the negative regulation of innate and inflammatory signalling. Thus, 
recent advances may not only modify our understanding ofimmunity (in 
terms of understanding new roles of the autophagy machinery in immune 
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regulation) but also reshape our understanding of the pathogenesis of 
inflammatory diseases (in terms of understanding how perturbations in 
autophagy protein function may contribute to such diseases). 

Clearly, our understanding of the molecular mechanisms of the pleth- 
ora of functions of autophagy proteins in immune-related processes is 
still quite primitive. We speculate that, similar to the way in which the 
initial genetic screens in yeast transformed autophagy research, current 
proteomic and genomic screens have the potential to transform research 
on autophagy and immunity. Such a transformation would include facil- 
itating a much deeper understanding of the molecular mechanisms of 
the existing known immunological functions of autophagy through the 
use of the tools of modern systems biology to understand autophagy 
protein-protein interaction and signalling regulatory networks on a 
broad scale. Perhaps more exciting is the possibility that such a trans- 
formation will uncover new ways in which this ancient self-defence 
machinery can function in immunity. = 
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Pervasive roles of microRNAs in 
cardiovascular biology 


Eric M. Small! & Eric N. Olson! 


First recognized as regulators of development in worms and fruitflies, microRNAs are emerging as pivotal modulators 
of mammalian cardiovascular development and disease. Individual microRNAs modulate the expression of collections 
of messenger RNA targets that often have related functions, thereby governing complex biological processes. The wide- 
ranging functions of microRNAs in the cardiovascular system have provided new perspectives on disease mechanisms 
and have revealed intriguing therapeutic targets, as well as diagnostics, for a variety of cardiovascular disorders. 


congenital birth defects and causes of adult morbidity and 

mortality'”. Although the cellular mechanisms and gene 
mutations responsible for numerous cardiovascular disorders have 
been extensively studied, it has become apparent only recently that 
microRNAs (miRNAs) have key roles in cardiovascular development 
and disease**. The prominent functions of miRNAs in cardiovascular 
biology probably reflect the sensitivity of the cardiovascular system 
to relatively subtle perturbations in gene expression, which can result 
in severe and often fatal abnormalities. 

A primary role of miRNAs seems to be the ‘fine-tuning’ of gene 
expression to control development and tissue homeostasis*. However, 
under conditions of stress, the functions of miRNAs become especially 
pronounced, underscoring their roles in disease. Highly specific 
patterns of miRNA expression correlate with different cardiovascular 
disorders (suchas cardiac hypertrophy, heart failure” ’, post-myocardial 
infarction remodelling’** and vascular remodelling’*"*), and gain- and 
loss-of-function miRNA studies in mice have revealed pathogenic and 
protective functions of miRNAs in vivo*”’. Correlation of the cellular 
targets of miRNA action with cardiovascular phenotypes illuminates new 
biological pathways and disease mechanisms. Especially intriguing is the 
ability to manipulate individual miRNAs in vivo using oligonucleotide- 
based inhibitors or miRNA mimics, thereby opening up possibilities for 
the therapeutic manipulation of miRNAs”. 

In this Review, we describe the biology and mechanisms of action of 
miRNAs in the cardiovascular system, and consider the opportunities 
and challenges for the therapeutic modulation of miRNAs in cardio- 
vascular disease. See refs 8, 19 and 20 for more detailed reviews of the 
biosynthesis and mechanisms of action of miRNAs. 


D iseases of the cardiovascular system are the most common 


Functional concepts of miRNA action 

MicroRNAs are ~22-nucleotide single-stranded RNAs that inhibit the 
expression of specific mRNA targets through Watson-Crick base pair- 
ing between the miRNA ‘seed region’ and sequences commonly located 
in the 3’ untranslated regions (UTRs). The human genome is estimated 
to encode up to 1,000 miRNAs”, which are either transcribed as stand- 
alone transcripts, frequently encoding several miRNAs, or generated by 
the processing of introns of protein-coding genes”. The integration of 
miRNAs into introns of protein-coding genes serves to coordinate the 
expression of the miRNA with the mRNA encoded by that gene, without 
the necessity for a separate set of cis-regulatory elements to drive expres- 
sion of the miRNA (Fig. 1a). It is not uncommon for intronic miRNAs to 


modulate the same biological processes as the protein encoded by the host 
gene”, The dual functions of such genes, encoding protein and miRNA, 
provide sophisticated feedback and feedforward regulatory networks, 
specific examples of which are highlighted throughout this Review. 

Genetic deletions of miRNAs in organisms ranging from worms 
to mice have shown that few developmental processes are absolutely 
dependent on single miRNAs*”. A recent study using compound mutant 
worms suggested there was significant redundancy within miRNA 
families, between unrelated miRNAs, and even between miRNAs and 
transcription factors, perhaps evolving as a buffer against deleterious 
variations in gene-expression programs~”. The actions of miRNAs often 
become pronounced under conditions of physiological or pathological 
signalling, suggesting conditional activities of miRNAs that necessitate 
genetic perturbation or sensitizing agents to uncover their functions. 

miRNAs typically exert modest inhibitory effects on many mRNAs, 
which often encode proteins that govern the same biological process 
— for example, the fibrotic response is inhibited by miR-29 (ref. 14), 
cardiac conduction by miR-1 (refs 30-32), actin cytoskeletal dynamics by 
miR-145 (ref. 16), the phosphatidylinositol-3-OH kinase (PI(3)K)-AKT 
pathway by miR-486 (ref. 33) and stem-cell pluripotency by miR-145 
(ref. 34). The cumulative reduction in expression of several components 
of a molecular pathway reduces the importance of a single miRNA- 
mRNA interaction to elicit a biological response, and adds robustness 
to gene-regulatory networks (Fig. 1b). The multiplicity of miRNA targets 
may also promote combinatorial regulation by miRNAs that individually 
target various mRNAs whose protein products contribute to one particu- 
lar regulatory axis (Fig. 1c). In this model, a biological response would be 
expected only after co-expression of several miRNAs that cooperatively 
target various components of a functional network or are all required to 
sufficiently repress a single target. By contrast, some miRNAs seem to 
reinforce an appropriately ‘balanced’ pathway by targeting both positive 
and negative regulatory components (for example, agonism and antago- 
nism of Nodal signalling by miR-430)* (Fig. 1d). This mode of action 
allows buffering against minor physiological variations. Clearly, miRNA 
biology is a complex and highly orchestrated mode of gene regulation, 
potentially impinging on nearly all biological processes in mammals and 
having particularly important roles in disease states. 


Oligonucleotide modulation of miRNA function 

The ability of miRNAs to modulate important biological pathways offers 
opportunities for the manipulation of miRNA function using oligonu- 
cleotide inhibitors (antimiRs) or miRNA mimics (Fig. 2). Antisense 
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Figure 1 | Concepts of miRNA function. The potential modes of miRNA- 
based regulation of gene expression are shown. a, Intronic miRNAs are 
encoded within an intron of a host gene. mRNA splicing generates a protein- 
coding transcript and an miRNA stem-loop. Intronic miRNAs often regulate 
similar processes to that of the protein encoded by the host gene. AAA, 
polyadenylated tail of the transcript; pre-miRNA, precursor miRNA. 

b, Acommon mechanism of miRNA function involves the modest repression 
of several mRNAs in a common biological process by a single miRNA. This 
mechanism reduces the dependence on a single miRNA-mRNA interaction 
and increases the robustness of the gene-regulatory network. TF, transcription 
factor. c, Many miRNAs may cooperatively or redundantly regulate a single 
biological process, by individually targeting many components of that 
process or by synergistically repressing a crucial component of a pathway. 

d, miRNAs may act as a ‘buffer’ against minor perturbations in a biological 
pathway. This is accomplished by the targeting of factors that positively and 
negatively influence a particular process, thereby insulating that process from 
environmental fluctuations. 


oligonucleotides directed against specific miRNA sequences are effi- 
ciently taken up by a variety of tissues and block miRNA function in 
the heart and vasculature”®. 

Other oligonucleotide-based techniques involve ‘target protectors’ or 
‘masks, which block individual miRNAs from binding to their mRNA 
targets, thereby rescuing the mRNA from inhibition. Target protectors 
have been validated in zebrafish® and in cultured cardiac myocytes”. 
miRNA ‘sponges’ or ‘decoys’ containing several miRNA-binding sites 
also act as competitive inhibitors for miRNA binding””*. Although the 
results obtained from pharmacological knockdown or overexpression 
of miRNAs sometimes differ from those obtained using genetic mouse 
models, the development of these various technologies has greatly accel- 
erated the rate at which basic biological questions can be answered in 
an experimental setting. 


miRNAs in cardiovascular development 

The requirement of miRNAs for cardiovascular development and 
function was initially demonstrated by tissue-specific deletion in 
mice of the Dicer gene, which encodes an enzyme that is essential 
for miRNA processing. Lethal phenotypes were observed after Dicer 
deletion in myocardial and vascular lineages””*’. Although these 
findings highlight the crucial roles of miRNAs in the cardiovascular 
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system, no specific miRNA deletion has yet been found to cause 
fully penetrant embryonic lethality in mice, indicating significant 
redundancy of miRNA function” and suggesting that the lethal con- 
sequences of Dicer deletion reflect the collective functions of many 
miRNAs rather than any single miRNA. The rapidly expanding 
number of miRNAs implicated in various aspects of cardiovascular 
biology precludes an in-depth review of all of them, so general prin- 
ciples of miRNA regulation and function are considered throughout 
this Review. 


Roles of miRNAs in heart development 

Heart formation requires precise and complex interactions among 
diverse cell types from several lineages — cardiomyocytes, endocar- 
dial, epicardial and vascular cells, fibroblasts and cells of the conduction 
system. Specific miRNAs are enriched in different cardiac cell types and, 
in some cases, have been found to participate in the specification of 
cell identity. Genetic ablation and antisense oligonucleotide-mediated 
knockdown studies have shown miRNA contributions to developmental 
processes as diverse as embryonic stem (ES)-cell differentiation, cardio- 
myocyte proliferation, contractility, ion-channel regulation and cardiac 
conduction (Fig. 3). 

Expression profiling has shown that the 18 most abundant 
miRNAsin the heart account for more than 90% of all cardiac miRNAs”. 
Because a threshold level of miRNA expression seems to be required for 
the efficient repression of target gene expression (typically >100 copies 
per cell)”, the regulation of heart development may depend on either 
a relatively discrete set of miRNAs or the combinatorial function ofa 
larger array of miRNAs expressed at a low level. So far, a functional role in 
heart development has been ascribed to only the most enriched miRNAs, 
reflecting a dosage requirement, functional redundancy or both. 

miR-1 is the most abundant miRNA in cardiac myocytes, and it was 
the first miRNA implicated in heart development*’. miR-1 and the 
related miRNA miR-133 arise from a common precursor RNA, the 
expression of which in the embryonic heart is mediated by two sepa- 
rate enhancers that are regulated by the transcription factors SRF and 
MEF2 (refs 43, 44), integrating these miRNAs into well-characterized 
transcriptional networks. miR-1 and miR-133 seem to function coop- 
eratively to promote mesoderm differentiation of ES cells and suppress 
endodermal and ectodermal cell fates’. By contrast, they have opposing 
roles later in the cardiac lineage when miR-1 promotes and miR-133 
inhibits cardiomyocyte differentiation”. Neither miR-1 nor miR-133 is 
absolutely required for the specification of cardiac cell fates in vivo, as 
50% of mice lacking either miRNA are viable”“*. This disparity between 
the functions of miRNAs as determined by in vitro assays versus in 
vivo loss-of-function studies is a common theme, and suggests that 
compensatory mechanisms that account for the unexpectedly mild 
phenotypes may be activated in genetic knockout mice. Zebrafish seem 
to be particularly sensitive to miRNA regulation, such that inhibition 
of specific miRNAs evokes more dramatic phenotypes than those 
seen in mutant mice. For example, antisense-mediated knockdown of 
miRNAs in zebrafish has revealed roles for miRNAs in the formation of 
the cardiac chambers and the atrioventricular canal”. 


Roles of miRNAs in vascular and blood development 

The formation and function of the vascular system requires the estab- 
lishment and remodelling of a contiguous series of lumenized tubes 
made of endothelial cells. Concurrent recruitment of vascular smooth 
muscle cells (SMCs) to the endothelial plexus during vessel matura- 
tion imparts the necessary tone and contractility for proper blood flow. 
Numerous miRNAs have been shown to govern these processes during 
vascular development and disease (Fig. 4). 

The endothelial-cell-specific miRNA miR-126 is encoded by an 
intron of the epidermal growth factor-like domain 7 (Egf17) gene, 
which encodes an endothelial-cell-enriched growth factor involved in 
the control of cell migration”. miR-126 is induced by blood flow and 
controls angiogenic sprouting of aortic arch vessels by the stimulation 
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of vascular endothelial growth factor signalling”. Mice lacking miR-126 
are partially viable but have fragile and leaky blood vessels and defects 
in angiogenesis”. Antisense-oligonucleotide-mediated knockdown 
of miR-126 in zebrafish causes complete embryonic lethality owing to 
the loss of vascular integrity and haemorrhaging’. Vascular patterning 
in mouse retinas is also modulated by miR-218, which is encoded by 
an intron of the Slit1 and Slit2 genes and inhibits several components 
of the SLIT-ROBO signalling pathway™. This study is an important 
demonstration of the coordinated regulation ofa biological process by 
an miRNA and its host gene. 

miR-143 and miR-145, encoded by a bicistronic pre-miRNA, are 
expressed specifically in SMCs under the control of SRF and mem- 
bers of the myocardin family of co-activators. These miRNAs target 
numerous regulators of actin signalling, including Rho GTPases, sling- 
shot homologue 2, adducin, cofilin and actin itself’®. miR-145 has been 
reported to be necessary and sufficient for SMC differentiation in vitro”. 
However, mice lacking both miR-143 and miR-145 are viable, suggest- 
ing that further mechanisms modulate their functions in vivo'®°*”. 
miR-145-mutant mice have reduced vascular tone, which contributes 
to areduction in blood pressure’®*”. Vascular SMCs from these mutant 
mice show diminished sensitivity to mechanical injury and an abnor- 
mality in phenotypic switching in response to injury that seems to reflect 
perturbations in actin signalling and SRF activation. Collectively, these 
studies demonstrate that miRNAs function as sensors of mechanical and 
environmental changes, thus linking dynamic physiological processes 
with the regulation of gene expression. 

The differentiation of blood cells is also dependent on miRNA 
activity’. An miRNA expression signature has been described for 
haematopoietic stem-cell progenitors, which show dynamic regula- 
tion during differentiation”. Furthermore, the Ago2- (also known as 
Eif2c2-) null mouse has erythroid lineage defects”, and modulation of 
miRNA expression in erythroid progenitors suggests a role for miR- 
NAs in their differentiation®’™. Indeed, loss-of-function studies in 
mice have also implicated miRNAs, including miR-223 and miR-451, 
in erythroid proliferation and differentiation”. For example, gene 
targeting and pharmacological knockdown of miR-451, which is 
enriched in erythroid cells, results in reduced baseline haematocrit 
levels and impaired erythroid expansion in response to oxidative 
stress®*>°°, Further analysis of miRNAs in circulating cells may reveal 
roles in functions such as oxygen delivery, angiogenesis and the inflam- 
matory process. 


miRNAs in cardiovascular disease 

Heart failure and several cardiovascular diseases are associated with the 
re-expression of the fetal cardiac gene program, which may have causa- 
tive or adaptive roles' and includes a signature pattern of miRNAs’*”. 
Indeed, numerous cardiac-enriched miRNAs show dynamic regulation 
in human heart disease, suggesting their involvement in the regulation 
of cardiovascular disease”’*”, 


Roles of miRNAs in heart disease 

The importance of individual miRNAs in the setting of heart disease 
has been shown by genetic deletion in mice subjected to various car- 
diovascular insults (Fig. 3). miRNAs are implicated in pathologies as 
diverse as arrhythmias (miR-1 (ref. 31), miR-133 (ref. 32) and miR-208a 
(ref. 67)), fibrosis (miR-21 (ref. 68) and miR-29 (ref. 14)), pressure- 
overload-induced remodelling (miR-208 (refs 67, 69) and miR-133 
(ref. 70)), and metabolic disorders (miR-33 (ref. 24)). 

One of the best-characterized examples of stress-dependent gene 
regulation by an miRNA involves a family of miRNAs encoded by 
myosin heavy chain (MHC) genes, referred to as MyomiRs””””". 
This is also an example of intronic miRNAs participating in a process 
related to host gene function. Three members of this miRNA family, 
miR-208a, miR-208b and miR-499, are encoded by the a-MHC (also 
known as Myhé), B-MHC (Myh7) and Myh7b genes, respectively. These 
MyomiRs regulate a collection of transcriptional repressors and 
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Figure 2 | Oligonucleotide manipulation of miRNA function. The 
various methods of artificially modulating miRNA expression or activity 
are shown. Endogenous miRNA (red) binds to complementary sequences in 
the 3’ UTR ofa target gene, resulting in translational repression or mRNA 
degradation. An miRNA mimic (green) consists of an oligonucleotide 
duplex of the miRNA and a passenger strand. The miRNA mimic 

comprises the same nucleotide sequence as an endogenous miRNA, and 

is designed to target the same mRNAs as that miRNA. An antimiR (grey) 

is an oligonucleotide that is complementary to an endogenous miRNA, 
thereby designed to bind and inhibit its function. A target mask (blue) is 

an oligonucleotide designed to bind to a portion of an endogenous miRNA 
target without initiating mRNA degradation or translational inhibition. This 
strategy rescues one particular mRNA from miRNA-mediated repression. 
miRNA sponges consist of an open reading frame (ORF) linked to a 3’ 

UTR that contains several binding sites for a particular miRNA, acting as 
competitive inhibitors for miRNA binding. 


signalling molecules that govern MHC expression, as well as thyroid 
hormone activity and the stress-responsiveness of cardiac muscle cells. 
Deletion of Mir208a in mice abrogates the re-activation of the fetal 
6-MHC gene in response to haemodynamic cardiac stress, and pro- 
tects the heart from pathological remodelling”. The MyomiR family 
thus constitutes an intricate regulatory circuit that controls myosin 
gene expression and cardiac stress responsiveness during adaptation 
to pathological signalling. 

In addition to the miR-208 family, several other miRNAs have been 
implicated as either causative or protective in heart disease. NFATc3 
isa transcriptional mediator of cardiac stress signalling that promotes 
pathological hypertrophy”. NFATc3 was recently shown to induce 
miR-23a expression in cardiomyocytes, and antagomir-based knock- 
down of miR-23a in mice abrogates isoproterenol-induced cardiac 
hypertrophy”*. Conversely, acute knockdown of miR-133 was shown 
to induce pathological cardiac hypertrophy in mice”, suggesting a 
potential cardioprotective role for endogenous miR-133. However, 
these findings contrast with the phenotype of Mir133-null mice, which 
undergo a normal hypertrophic response”, highlighting the differ- 
ence between pharmacological modulation of miRNA expression and 
genetic deletion studies. 

The miR-29 family, which is downregulated after myocardial infarc- 
tion, inhibits the expression of several collagens and extracellular matrix 
proteins, thereby contributing to scar formation and fibrosis”. Similarly, 
the miR-199 family is rapidly downregulated in cardiac myocytes under 
hypoxic conditions, relieving the repression of sirtuin 1 and hypoxia- 
inducible factor 1-a in a model of hypoxia preconditioning”. 

The miRNA that repeatedly shows dynamic regulation after cellular 
stress is miR-21, which was shown to promote cardiac hypertrophy 
and fibrosis in response to pressure overload. Knockdown of miR-21 
with a cholesterol-modified antagomir attenuated cardiac remodelling 
after thoracic aortic constriction®. This response was attributed to the 
derepression of the protein sprouty, which negatively regulates the pro- 
fibrotic extracellular signal-regulated kinase-mitogen-activated protein 
kinase (ERK-MAPK) cascade in cardiac fibroblasts™. Paradoxically, 
however, neither genetic deletion nor tiny locked-nucleic-acid (LNA)- 
mediated knockdown of miR-21 in mice alters fibrosis or hypertrophy 
in response to thoracic aortic constriction or other cardiac stresses”. 
The contrasting conclusions of these studies emphasize the gaps in our 
understanding of the mechanisms of miRNA action and oligonucleo- 
tide-based targeting strategies for their inhibition. 


© 2011 Macmillan Publishers Limited. All rights reserved 


Normal 


Cardiac- 
conduction 
system 


Injury 


miR- pala 133 


JIN 


miR-208a 
mm 


Cell-cycle Pacemaker/ Transcriptional ’ 
“ ion channels regulators 

Cardiac 
7 conduction 


afte 
| 


3 
R <— D5 


Differentiation 


Hypertrophic 


REVIEW 


Figure 3 | Functional role of miRNAs in the 
normal and diseased heart. A normal and 

a hypertrophic heart are shown in schematic 
form, depicting miRNAs that contribute to 
normal function or pathological remodelling. 
The expression of selected miRNAs within the 
heart is shown, along with their corresponding 
functions. All arrows denote the normal action 
of each component or process. miR-1 and 
miR-133 are involved in the development of a 
normal heart (left) by regulating proliferation, 
differentiation and cardiac conduction. For 
example, proliferation is promoted by cell-cycle 
regulators, but miR-1 and miR-133 block these 
regulators, thus blocking proliferation. miR- 
208a also contributes to the regulation of the 
conduction system. After cardiac injury (right), 
various miRNAs contribute to pathological 
remodelling and the progression to heart failure. 
miR-29 and miR-21 block and promote cardiac 
fibrosis, respectively. miR-29 blocks fibrosis by 
inhibiting the expression of ECM components, 
whereas miR-21 promotes fibrosis by stimulating 
mitogen-activated protein kinase (MAPK) 
signalling. miR-208 controls myosin isoform 
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Roles of miRNAs in vascular disease 

The vessel wall is composed of endothelial cells and SMCs that 
must maintain a sealed barrier yet allow the exchange of oxygen 
and nutrients with adjacent tissues. Vessels can respond to injury 
or changes in the environment by undergoing phenotypic changes 
that promote endothelial cell migration or fragility, as well as SMC 
de-differentiation, proliferation and migration. Numerous miRNAs 
show marked alterations in expression during vascular injury and 
disease, and expression signatures have now been correlated with 
pathologies such as ischaemia, tumour angiogenesis, atherosclerosis 
anda proliferative thickening and obstruction of the vessel known 
as restenosis’®. Some miRNAs have been shown to have causal roles 
in these disorders (Fig. 4). 

Angiogenesis is a process of endothelial cell proliferation and vascular 
tube sprouting that is promoted in adulthood by various stimuli, inclu- 
ding tumour growth, retinal damage and ischaemia. miR-21 can influ- 
ence the function and migration of angiogenic progenitor cells during 
coronary artery disease”’. Likewise, ischaemia-induced angiogenesis in 
adult tissues can be promoted or inhibited by antisense oligonucleotides 
directed against miR-92a (ref. 77) or miR-126 (ref. 78), respectively. 
miR-126 may also influence susceptibility to atherosclerosis, through 
the modification of endothelial cell function”. 

Vessel injury, instigated by diverse factors such as atherosclerosis, 
hypertension and damage due to a mechanical stenting, results in 
SMC phenotypic changes indicative of a de-differentiated state. Such 
SMCs become proliferative and migratory, entering the vessel lumen 
and causing restenosis. Recent studies have implicated miRNAs as 
mediators of SMC phenotypic modulation and vessel remodelling. 
The expression of miR-21 and the miR- 143/145 cluster are up- and 
downregulated, respectively, after mechanical injury of large vessels", 
and restoration of miR-21 and miR-145 to normal levels prevents 
restenosis’*"**". 


— Hypertrophy 


block apoptosis, respectively. ECM, extracellular 
matrix; LV, left ventricle; MHC, myosin heavy 
chain; RV, right ventricle. 
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miRNA mutations as the basis of disease 

The pervasive influence of miRNAs on cardiovascular function and 
disease raises questions as to whether polymorphisms in miRNAs 
or their target sequences in mRNA transcripts affect human disease. 
Although mutations within the seed regions of evolutionarily con- 
served miRNAs are not common, single nucleotide polymorphisms 
(SNPs) within miRNA-binding sites in the 3’ UTRs of target mRNAs 
have been observed at a higher frequency*”*’. A notable example is 
in the Texel breed of sheep, which develops extreme skeletal muscle 
hypertrophy owing to an SNP in the 3’ UTR of the mRNA encod- 
ing myostatin, a negative regulator of muscle growth™. This mutation 
creates a binding site for miR-1, resulting in repression of myostatin 
expression and unrestricted muscle growth. SNPs within potential 
miRNA-binding sites have also been identified in mRNAs associated 
with hypertension and cardiovascular disease*’. SNPs in miRNAs or 
their targets that cause significant phenotypes seem unlikely to be a 
widespread occurrence, however, because of the substantial degeneracy 
allowed in miRNA-mRNA interactions and the redundant regulation 
of an individual mRNA by several unrelated miRNAs. It remains to 
be determined whether a causative link can be made between miRNA 
SNPs and human disease. 


Clinical perspectives 

Identifying the signature patterns of miRNAs associated with differ- 
ent cardiovascular disorders has opened up opportunities for miRNA 
diagnostics. miRNA profiling can discriminate between specific forms 
of heart disease'””’, such as dilated cardiomyopathy, ischaemic cardio- 
myopathy and heart failure, and disease-associated miRNA expression 
patterns in failing human hearts can be normalized by the stabilization 
of cardiac output''**. Recently, several miRNAs have been detected in 
plasma and reported to be diagnostic for heart failure and myocardial 
infarction’ *. Whether circulating miRNAs are functionally relevant 
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Figure 4 | Functional role of miRNAs in the vascular system. Blood vessel 
schematic showing the endothelial and smooth muscle layers, red blood 

cells and the proliferating SMCs of a neointimal lesion. The expression of 
select miRNAs is shown, along with their observed functional role. Hypoxia 
results in the activation of miR-210 and miR-92a, which promote and inhibit 
angiogenesis, respectively. miR-126, an endothelial-cell-enriched miRNA 
encoded by an intron of the Egfl7 gene, modulates atherosclerosis and 
angiogenesis by regulating MAPK and PI(3)K signalling. Angiogenesis is also 
regulated by miR-218, which is encoded by an intron of the Slit genes. 
miR-143 and miR-145 are expressed in SMCs and control blood pressure and 
vascular tone, and contribute to vascular remodelling. miR-21 is induced 

in SMCs after vascular injury, and promotes proliferation and neointima 
formation. miR-451 regulates the proliferation and differentiation of 
erythroid cells. 


or are simply released from injured tissues remains to be determined. 
However, the cellular secretion of particular miRNAs by exosomes sug- 
gests specificity in the process of miRNA secretion. 

In contrast to many cellular mediators of disease, which are difficult 
(or impossible) to modulate therapeutically, it is unquestionable that 
drugs can target miRNAs. Thus, the involvement of miRNAs in almost 
every aspect of cardiovascular disease raises exciting possibilities for 
the therapeutic manipulation of miRNA-regulated processes. Thera- 
pies based on antimiRs or miRNA mimics are now being developed 
to repress pathological miRNAs or overexpress protective miRNAs, 
respectively. Indeed, antimiR-based studies demonstrating efficacy 
in non-human primates have already been reported””’ and have been 
advanced to human clinical trials. 

The ability of individual miRNAs to modulate complex disease path- 
ways through the targeting of several components of regulatory net- 
works enables miRNAs to modulate tissue stress responses in a manner 
that is distinct from that of classical drugs. The multiplicity of miRNA 
targets also enables miRNAs to bypass mechanisms that render cells 
or tissues insensitive to certain drugs. For example, cells can develop 
insensitivity to single drugs through rare mutations in drug targets or 
desensitization of cell-surface receptors. Such mechanisms are unlikely 
to diminish sensitivity to miRNA inhibitors, which target several steps 
in a disease pathway. 
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Conversely, the targeting of large collections of mRNAs raises pos- 
sibilities for off-target effects or even opposing effects of miRNAs in 
different tissues. Because the mechanistic basis of miRNA-based thera- 
peutics is not clear, the possibility exists that modulating such a diverse 
set of target mRNAs will affect beneficial processes as well as the patho- 
logical condition. The heart takes up globally administered antimiR 
oligonucleotides less efficiently than the kidneys and liver, and the phar- 
macokinetics of miRNA-based therapies remain a hurdle. This issue 
may necessitate the development of new cardiovascular delivery systems 
for miRNA-based therapeutics, to limit uptake in healthy tissue. These 
methods would not be required for strategies involving knockdown of 
cardiac-specific miRNAs. Conjugation of antimiRs or miRNA mimics 
to homing molecules such as peptides, antibodies or other bioactive 
molecules might enrich uptake in cardiac tissue. This technology has 
not yet been successfully translated to the clinic, however, and other 
methods may improve tissue-specific uptake, such as direct administra- 
tion by cardiac catheterization or a drug-coated stent. 


Looking to the future 

Despite recent advances in identifying miRNA contributions to car- 
diovascular development and disease, as well as in developing miRNA 
diagnostics and miRNA inhibitors, many gaps remain in our knowl- 
edge of miRNA-based regulation of gene expression in the normal 
and diseased heart and cardiovascular system. For example, the many 
potential target mRNAs for each miRNA pose significant challenges 
to the identification of those mRNAs that are relevant to a particular 
miRNA-regulated process. Compounding this difficulty is the apparent 
variability in miRNA function based on physiological context or cell 
type, making it necessary to define the potential disparate functions 
of individual miRNAs in different settings. Another important con- 
sideration is that combinatorial interactions between multiple miR- 
NAs with common or coordinated target mRNAs are likely to have a 
major role in gene regulation and the control of physiological pathways. 
Thus, it will be crucial to identify sets of miRNAs acting cooperatively 
within the cardiovascular system. This information will be particularly 
relevant to the development of miRNA-based therapeutics, as 
cocktails of miRNA inhibitors may prove more efficacious than target- 
ing a single miRNA. 

With the current pace of advancements in deciphering the basic prin- 
ciples of miRNA action in cardiovascular development and disease, we 
foresee new therapeutic applications for the prevention and treatment 
of human pathologies based on miRNA biology in the relatively near 
future. m 
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The Polycomb complex PRC2 and its 


mark in life 


Raphaél Margueron’”” & Danny Reinberg* 


Polycomb group proteins maintain the gene-expression pattern of different cells that is set during early development by 
regulating chromatin structure. In mammals, two main Polycomb group complexes exist — Polycomb repressive complex 1 
(PRC1) and 2 (PRC2). PRC1 compacts chromatin and catalyses the monoubiquitylation of histone H2A. PRC2 also contributes 
to chromatin compaction, and catalyses the methylation of histone H3 at lysine 27. PRC2 is involved in various biological 
processes, including differentiation, maintaining cell identity and proliferation, and stem-cell plasticity. Recent studies 
of PRC2 have expanded our perspectives on its function and regulation, and uncovered a role for non-coding RNA in the 


recruitment of PRC2 to target genes. 


that displayed improper body segmentation’. It was suggested 

that Polycomb encodes a negative regulator of the homeotic genes 
that are required for segmentation’. The Polycomb group (PcG) now 
defines a set of genes characterized by mutations that result in similar 
phenotypes to those of Polycomb. The crucial role of PcG proteins dur- 
ing development is highlighted by early embryonic lethality in mice 
after the deletion of genes encoding some of these proteins (Eed, Ezh2 
(also known as Enx-1), Suz12 and Ring1B (Rnf2)). The antagonistic 
activities of the PcG and the trithorax families of proteins culminate 
in the maintenance, throughout development and adulthood, of the 
appropriate patterns of homeotic gene expression in a spatially defined 
manner’. PcG proteins are found in several families of multiprotein 
complexes, including the Polycomb repressive complexes PRC1 
and PRC2 (Fig. 1). Two other PcG complexes were characterized in 
Drosophila, PHO-repressive complex (PhoRC) and Polycomb repres- 
sive deubiquitinase (PR-DUB), and their components have orthologues 
in mammals; however, the conservation of their functions has not yet 
been addressed**. 

Polycomb-mediated gene silencing is thought to rely mostly on 
the regulation of chromatin structure, in part through post-transla- 
tional modification (PTM) of histones. Hence, the PRC2 complex is 
responsible for the methylation (di- and tri-) of Lys 27 of histone H3 
(H3K27me2/3)*° through its enzymatic subunits EZH1 and EZH2, 
whereas the PRC1 complex monoubiquitylates Lys 119 of histone H2A 
(H2AK119ub) via the ubiquitin ligases RING1A and RINGIB (Fig. 1). 
In addition, some PRC1 complexes can regulate gene expression by 
compacting chromatin in a manner independent of enzymatic activ- 
ity’. The PRC1 component Pc (known as CBX in mammals) binds 
specifically to the product of PRC2 catalysis, H3K27me3, leading to the 
hypothesis that PRC1 functions downstream of PRC2. Although this 
premise is still cited in the literature, its operational status is equivocal 
as there are genes targeted by PRC2 that lack H2AK119ub’ and genes 
targeted by PRCI in the absence of PRC2 (refs 9, 10). Nonetheless, PRC2 
and PRC1 are often both required to maintain gene repression. 

Owing to the pivotal role of PRC2 in the coordination of PcG protein 
function, the still partial characterization of PRC1 and PRC1-like com- 
plexes in mammals, and the existence of up-to-date reviews on PRC1 
(refs 3, 6), this Review focuses primarily on mammalian PRC2. After 


ck: term Polycomb (Pc) initially referred to a Drosophila mutant 


considering PRC2 in terms of evolution, we evaluate the newly appre- 
ciated, variable composition of PRC2 and describe the function of its 
catalytic product and its localization. Finally, we discuss the biological 
roles of PRC2, and propose a model for its recruitment to target genes 
that involves non-coding RNA (ncRNA). 


Evolution of PRC2 

The core PRC2 complex, which is conserved from Drosophila to 
mammals, comprises four components: EZH1/2, SUZ12, EED and 
RbAp46/48 (also known as RBBP7/4). The composition of PRC1 
complexes is more variable, with only two core common components — 
RING1A/B together with BMI1, MEL18 (PCGF2) or NSPC1 (PCGF1)*" 
(Fig. 1). The presence of PRC2 in various unicellular eukaryotes led to 
the suggestion that it existed in the last common unicellular ancestor, 
but it was lost at times during evolution as exemplified by the cases 
of Schizosaccharomyces pombe and Saccharomyces cerevisiae, in which 
PRC2 is absent”. Notably, the PRC2 components, in contrast to those 
of PRC1, underwent little duplication in mammals, with vertebrates 
containing two copies of enhancer of zeste homologue, EZH1 and EZH2 
(ref. 12). Drosophila has two copies of the EED homologue, ESC and 
ESCL. Although ESC and ESCL are interchangeable”’, the same might 
not be true for EZH1 and EZH2, which have different expression pat- 
terns. EZH_1 is present in both dividing and differentiated cells, whereas 
EZH2 is found only in actively dividing cells. Also, PRC2 complexes 
containing EZH1 (PRC2-EZH1) in lieu of EZH2 have low methyl- 
transferase activity compared with PRC2—EZH2 (ref. 14). This indi- 
cates that PRC2-EZH2 establishes cellular H3K27me2/3 levels through 
its EZH2-mediated methyltransferase activity, and that PRC2-EZH1 
restores H3K27me2/3 that could have been lost after histone exchange 
or through demethylase activity. Moreover, PRC2—-EZH1 and -EZH2 
have distinct chromatin-binding properties, as illustrated by the specific 
chromatin-compaction property of PRC2-EZH1 (ref. 14). 

In contrast to mammals, PRC2 evolved towards a greater complex- 
ity in plants, with species such as Arabidopsis thaliana having up to 
12 homologues of PRC2 components’*. A homologue of the mam- 
malian and S. pombe heterochromatin protein 1 (HP1) that binds to 
H3K9me3 also exists in plants and is denoted LHP1. LHP1 binds to 
H3K27me3 and interacts with the RING] homologues AtRINGIA 
and AtRING1B, suggesting the existence of a PRC1-like complex 
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Figure 1 | The Polycomb complexes PRC1 and PRC2. a, Diagrams 
representing the composition of PRC2 and PRC1 are shown. In PRC1, the 
diagrams shown on the left correspond to the classical PRC1 complexes, 
whereas those on the right correspond to the so-called PRC1-like complexes. 
Owing to their homology with the Drosophila PSC protein, we assumed 

that the BMI1-, MEL18- and NSPC1-containing PRC1 complexes could 
compact chromatin. The ‘pocket’ shape of the CBX proteins represents the 
chromodomain that specifically recognized H3K9/27me3. HPH1, 2 and 

3 denote human polyhomeotic homologue 1, 2 and 3. X, Y and Z denote 
various proteins such as SCMH1/2, FBXL10, E2F6 and JARID1D that could 


in plants'’. Whereas EZH1 and EZH2 target the same genes and are 
expected to contribute to the same silencing pathway’®, the plant PRC2 
complexes were reported to have distinct functions”. 

On the basis of these criteria, we speculate that PRC2 evolved froma 
function that was partially redundant with gene silencing through the 
H3K9me3 pathway, gaining a more specific role as multicellular organ- 
isms acquired specific cell lineages. 


PRC2 comprises more than four components 

The first PRC2 purifications led to the identification of the four com- 
ponents that are required for its enzymatic activity in vitro. It was 
recently shown that PRC2 contains several other polypeptides (Fig. 1) 
— AEBP2, PCLs and JARID2 — the functions of which are described 
below. Of note, other proteins transiently interact with PRC2 (for exam- 
ple, DNMTs, HDAC] and SIRT1), but their effect on PRC2 function is 
unclear, and as such, they are not discussed further here. 

AEBP2 is a zinc-finger protein that was identified as part of the 
PRC2 complex. It interacts with several PRC2 components to enhance 
its enzymatic activity’, and co-localizes with PRC2 at some target 
genes'*. AEBP2 was postulated to bind DNA with an apparently relaxed 
specificity”®. 

PCL1, PCL2 and PCL3 (also known as PHF1, MTF2 and PHF19, 
respectively) are the three mammalian orthologues of Drosophila 
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contribute to the formation of PRC1-like complexes, whose exact composition 
is still enigmatic. b, Characterized domains with potential functions are 
indicated for each PRC2 component. In EZH2, box 1 and 2 refer to domains 
based on sequence homology, and the numbers below the scheme indicate 

the percentage similarity between mouse and Drosophila homologues for the 
corresponding domain. CXC, cysteine-rich domain; ncRBD, non-coding- 
RNA-binding domain; SANT, SWI3, ADA2, N-CoR and TFIIIB DNA- 
binding domain; SET, Su(var)3-9, enhancer of zeste, trithorax domain; VEFS, 
conserved among VRN2-EMF2-FIS2-SU(Z)12; WD40, short ~40 amino 
acid motifs. 


Polycomblike (PCL). They share the same protein motifs: a tudor domain, 
two plant homeodomain (PHD) finger proteins, a PCL extended domain 
and a carboxy-terminal domain tail’? (Fig. 1). PCL proteins interact with 
PRC2 through EZH2, and to some extent through SUZ12 and the his- 
tone chaperones RbAp46 and RbAp48 (ref. 20). Genome-wide studies 
showed that PCL2 co-occupied PRC2 target genes”. Various functions 
have been attributed to PCLs, from the regulation of PRC2 enzymatic 
activity” to the gene recruitment of PRC2 (refs 21, 24). Mammalian 
PCLs are expressed in a tissue-specific manner”, and this redundancy 
could explain apparent discrepancies between studies. The phenotypes 
associated with PCL mutation in Drosophila and Xenopus, and the co- 
localization and interaction of PCLs and PRC2, point to PCL proteins 
having a crucial role in PRC2 function. Understanding the underlying 
molecular mechanisms will probably require a detailed understanding 
of how PCLs interact with chromatin. 

JARID2 is the founding member of the Jumonji family of proteins 
that catalyses the demethylation of histone proteins, yet it lacks the 
key residues necessary for cofactor binding and is devoid of enzymatic 
activity. Its deletion in mice results in severe defects in cardiovascular 
and liver development”. The C-terminal half of JARID2 contains some 
conserved regions such as the ARID domain (a potential DNA-binding 
domain), the JmjC and JmjN domains, and a zinc finger (Fig. 1). JARID2 
was identified as a PRC2 component, and biochemical studies have 
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Figure 2 | Chromatin properties at PRC2 target genes in ES cells and 
differentiated somatic cells. Schematic representation of chromatin at 
PcG target genes as a function of ES-cell differentiation. In ES cells, most 
PcG targets are methylated on both H3K4 and H3K27 and co-localize 
with the histone variant H2Az. During differentiation, H2Az is removed, 


demonstrated its interaction with EZH2 (refs 22, 26-29). Genome- 
wide studies showed a large overlap between PRC2 and JARID2 
target genes””°’, and the recruitment of JARID2 and PRC2 seems 
to be partially interdependent. Surprisingly, although PRC2 recruit- 
ment is impaired after JARID2 inactivation, H3K27me3 levels are only 
modestly affected”””’. This observation led to the conclusion that 
JARID2 is an inhibitor of PRC2 enzymatic activity**”’. However, further 
characterization of the catalytic activity of PRC2 indicated that JARID2 
enhances PRC2 activity under defined biochemical conditions (ref. 22 
and J. Son, R.M. and D.R., unpublished data). Furthermore, JARID2 is 
able to bind to DNA with a slight bias towards (G+C)-rich sequences”. 
This observation correlates with the reported sequence composition 
of PRC2 target genes’, and is consistent with a potential function for 
JARID2 in PRC2 recruitment. 

Studies of these three PRC2 components have given rise to apparent 
discrepancies. Considering that the factors are not strictly required for 
PRC2 enzymatic activity in vitro, it is perhaps unsurprising that their 
inactivation would result in milder consequences than the inactivation 
ofa core PRC2 component. Nonetheless, these factors are necessary for 
optimum PRC2 activity, and the regulation of PRC2 recruitment and its 
enzymatic activity are tightly connected. We propose that PRC2 func- 
tions as a holoenzyme (Fig. 1), with the additive contribution of each of 
its components being required for maximum activity. 


H3K27 methylation 

It remains to be determined whether the composition of PRC2 is modi- 
fied as a consequence of, or during the processes of, tumorigenesis, 
development and/or maintenance of adult tissue specificity. Thus far, 
despite the different proteins found associated with the core complex, its 
integrity remains intact such that all PRC2 complexes containing either 
EZH1 or EZH2 catalyse H3K27 methylation. Lysine can be mono-, di- 
or trimethylated, with each methylation level likely to be functionally 
distinct. Methylation of H3K27 is processive (H3K27me3 results from 
monomethylation of H3K27me2), and H3K27me3 is a stable mar a0 
Methylated H3K27 is very abundant, with roughly 50% of the H3 his- 
tone being dimethylated, 15% trimethylated and 15% monomethylated 
in embryonic stem (ES) cells*". Although the Pc component of PRC1 
binds to H3K27me?2 and -me3 through its chromodomain in vitro, it 
seems specific for H3K27me3 in vivo, and H3K27me2 seems to be of 
limited importance for maintenance of gene repression”. We previ- 
ously proposed that H3K27mez2 is an important intermediary PRC2 
product, because it not only constitutes the substrate for subsequent 
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and some bivalent domains are resolved. For example, genes that are 
actively transcribed lose H3K27me3. A substantial proportion of PcG 
targets that retain H3K27me3 but lose H3K4me3 are targeted by 
other silencing pathways such as DNA methylation or H3K9 
trimethylation. 


H3K27me3 formation, but might also prevent H3K27 from being 
acetylated. Acetylated H3K27 is thought to be antagonistic to PcG- 
mediated silencing and is enriched in the absence of PRC2 (ref. 32). 

With the exception of a viral protein, PRC2 has the only enzymatic 
activity found thus far that both di- and trimethylates H3K27. These 
methyl marks are associated with facultative heterochromatin — a 
subdivision of heterochromatin that is regulated in a developmental- 
specific manner”. The monomethylated version of H3K27 is associated 
with constitutive heterochromatin — a more stably silent, gene-poor 
region of chromatin — but its enrichment through the gene body is 
correlated with actively transcribed genes*. Exactly how H3K27mel 
arises is still controversial. In plants, H3K27 is monomethylated by two 
enzymes, ATXR5 and ATXR6, which are distinct from PRC2 and not 
conserved in mammals”’. But H3K27mel in mammals is still detected 
in cells bearing non-functional PRC2 (refs 10, 36). We speculate that 
in mammals, H3K27mel is placed by an enzymatic activity distinct 
from that of PRC2, and that the presence of H3K27mel in actively tran- 
scribed genes could arise from the demethylation of H3K27me2/3 by 
the demethylases UTX or JMJD3 (ref. 37). Whether these demethylases 
can function on H3K27mel in vivo is an open question. 

In general, histone PTMs regulate biological processes either by 
altering chromatin structure (by loosening the DNA-histone inter- 
action) or by contributing to the recruitment of further regulatory 
factors. Thus far, H3K27me3 has been implicated in only the latter 
mechanism of action, suggesting that other factors such as PRC1 are 
required to maintain gene repression. H3K27me3 might also indi- 
rectly regulate transcription by sterically preventing proteins from 
binding to chromatin. Enrichment of H3K27me3 correlates with 
gene silencing”, and this observation is supported by the finding that 
H3K27me3 and H3K36me3, a mark that is linked to transcription 
elongation, have distinct localizations”. RNA polymerase (Pol) II 
that is phosphorylated at Ser 5 of its C-terminal domain is present in 
a substantial fraction of H3K27me3-enriched promoters”, and low 
transcript levels are detected”, leading to the suggestion that RNA 
Pol II could be paused at PcG-targeted genes*’. Indeed, several PcG- 
regulated genes in Drosophila and mammals can recruit the RNA 
Pol II transcription complex to their respective promoters and engage 
in early transcription, yet these polymerases encounter an early block 
to elongation. A recent study indicates that short transcripts that are 
generated after transcription and remain bound to a paused RNA 
Pol I could recruit PRC2 (ref. 42). If confirmed, this suggests that 
PRC2 and H3K27me3 can affect gene expression by controlling an 
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engaged RNA Pol II during promoter escape or elongation, rather than 
by regulating the initiation phase of transcription. A likely possibility 
is that PRC2 can repress transcription by different mechanisms, and 
this may be gene specific. 


Genome-wide localization of PRC2 and H3K27me3 
Several publications have reported the genome-wide localization of 
H3K27me3 in various cell lines and organisms, with some divergent 
results depending on the methodology used and the model analysed. 
A conservative estimation is that PRC2 targets represent at least 10% 
of the genes in ES cells. PRC2 specifically resides at — and targets for 
H3K27me3 deposition — the Hox genes and numerous genes encoding 
other developmental regulators“. Interestingly, in human cancer cells, 
the PRC2 component SUZ12 is mainly enriched at the promoters of genes 
encoding glycoprotein and immunoglobulin-like proteins”. Further 
studies are required to determine whether this is a consequence of 
the genetic and epigenetic alterations of cancer cells or whether it is a 
reflection of the cancer-cell origin. 

In Drosophila, domains enriched in H3K27me3 were found 
to cover large regions of the genome, usually exceeding 10 kilo- 
bases (kb)**”. In mammals, two different types of binding pattern 
have been reported for PRC2 or H3K27me3: some very large domains 
of more than 100 kb such as those containing the Hox loci, and 
some smaller domains covering a few kilobases*'*°*”°°. H3K27me3 
enrichment seems to be centred around the transcription start site of 
promoters, but with a lower intensity at the start site itself*’*’ (Fig. 2). 
Some H3K27me3 is found at intergenic regions**’, and H3K27me3 
is enriched in subtelomeric regions” and in long-terminal repeat 
retrotransposons”. 

To understand how PRC2 can maintain specific gene-expression 
patterns, the overall chromatin structure, in addition to H3K27me3 
patterns, should be considered™. This issue has generated a great deal 
of attention in the context of ES-cell differentiation (Fig. 2). ES cells 
are characterized by a more open and flexible chromatin organization 
anda higher overall rate of transcription, which is thought to be impor- 
tant for pluripotency”. Notably, the H3K4me3 mark, often associated 
with active transcription, was present at most, if not all, PRC2-targeted 
genes in ES cells, forming the ‘bivalent domain’””***!**, Although this 
pattern was initially believed to be ES-cell specific”, bivalent domains 
have been found in differentiated somatic cells, albeit at a lower fre- 
quency’; they were also found in zebrafish” but are rarely detected 
in Drosophila®’. Another histone species with seemingly disparate 
functionality that co-localizes with PRC2 is the histone variant H2Az, 
which is usually associated with active genes (Fig. 2). Indeed, PRC2 
and H2Az co-localize in undifferentiated ES cells, and their recruit- 
ment is interdependent”. The apparent contradiction in the presence 
of either H3K4me3 or H2Az with H3K27me3 at the promoters of silent 
genes in ES cells might reflect the necessary plasticity of these cells, but 
could also result in partial leakiness of gene silencing. That PRC2 and 
H2Az co-localize is consistent with the low levels of DNA methylation 
at PcG target genes in ES cells™*’, given the evolutionarily conserved 
exclusivity shown by H2Az and DNA methylation”. 

After ES-cell differentiation, a substantial fraction of bivalent domains 
that lose H3K4me3 and H2Az do gain DNA methylation”. Notably, 
genes enriched in both H3K27me3 and H3K9me3, another mark associ- 
ated with gene repression, are more abundant in human fetal lung fibro- 
blasts (IMR90) than in human ES cells”. In this same study, the authors 
showed that H3K27me3 domains are more extended in IMR90 cells or 
CD4' T cells than in ES cells, and that H3K27me3 domain expansion 
correlates with more efficient transcriptional silencing”. Altogether, these 
results indicate that somatic cells reinforce gene silencing by increasing 
the length of H3K27me3 domains and, for a fraction of PRC2-targeted 
genes, by complementary silencing pathways (H3K27me3 together with 
H3K9me3 or DNA methylation). Not surprisingly, some pluripotency 
factors, the expression of which could be deleterious in differentiated 
cells, are silenced in this redundant fashion™. 
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Figure 3 | The many interactions of PRC2 with chromatin. Schematic 
representation of the PRC2 holoenzyme at chromatin. Putative interactions with 
either DNA or histones that could explain PRC2 recruitment are highlighted. 


PRC2 recruitment 

Exactly how mammalian PRC2 is recruited to chromatin is not clear. 
In Drosophila, DNA sequences known as Polycomb response ele- 
ments (PRE) are targets for PcG protein recruitment when inserted 
at exogenous loci®®. Genetic experiments led to the identification of 
DNA-binding proteins that are required for PcG binding; however, 
genome-wide analysis showed that any one of these trans-acting factors 
only partially overlaps with PcG target genes. Instead, it is thought that 
a combination of these factors might be responsible for the recruitment 
of PcG proteins. 

In mammals, PRC2-targeted sequences are highly enriched in C+G, 
most of them being classified as CpG islands, but these sequences alone 
do not indicate a consensus response element’. Recently, two publica- 
tions identified a mammalian PRE on the basis of PcG complex recruit- 
ment in Drosophila’*’. Both reports suggested an important role for 
YY1, the mammalian orthologue of the Drosophila PRE DNA-binding 
protein PHO, as previously proposed™. RYBP, a protein that interacts 
with both YY1 and PRC1, was shown to be required for PRC1 and PRC2 
recruitment”. Yet genome-wide analysis in mammals did not showa 
clear overlap between YY 1 and PcG target genes®. Moreover, PRC2 is 
under-represented at YY1 response elements®. Hence, so far, there is 
no strong evidence for the involvement of transcription factors in the 
recruitment of PRC2 in mammals. 

On the other hand, long ncRNAs are becoming recognized as impor- 
tant participants in PRC2 function. In mammals, X-chromosome 
inactivation initiates the expression of a 17-kb ncRNA, XIST, which 
coats the X chromosome in cis. Coating with XIST RNA leads toa 
marked alteration of chromatin structure characterized by a progres- 
sive heterochromatinization. The inactive X chromosome becomes 
methylated at H3K27 in an XIST-dependent manner™. The two long 
stem-loop structures formed by the A repeats present 5’ in the XIST 
RNA interact with PRC2 in vitro”, although further regions of XIST 
are clearly involved because an XIST transcript in which the A repeats 
are deleted can still recruit PRC2 to the XIST RNA-coated X chromo- 
some”. Similarly, the long ncRNA KCNQIOTI can mediate PRC2 
spreading in cis, thereby maintaining the imprinted expression of the 
KCNQ1 domain”. Long ncRNA could also promote PRC2 binding in 
trans as shown for the RNA HOTAIR’™”, the expression of which from 
the HOXC locus is associated with repression of 40 kb of the HOXD 
locus. Such mechanisms could be common to a large fraction of long 
ncRNAs”. In light of these results, ncRNA seems to be a strong candi- 
date for PRC2 recruitment. 

Considering this information, we propose a model in which the 
sum of relatively weak interactions or low energy steps that are estab- 
lished by each of the PRC2 holoenzyme components would function 
together to attain the necessary energy to recruit PRC2 (Fig. 3). This 
model predicts up to four steps, not necessarily consecutive, that result 
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in the successful recruitment of PRC2: (1) the interaction of JARID2 
and AEBP2 with DNA'”; (2) the interaction of the histone chaper- 
ones RbAp46 or 48 with histones H3 or H4 (ref. 74); (3) the interac- 
tion of EED with the product of PRC2 catalysis, H3K27me3 (ref. 75), 
and of PCLs with an unknown histone mark; and (4) the interaction of 
PRC2 components with long ncRNA. The resultant binding specificity 
could then be modulated by the variation in the composition of the 
PRC2 holoenzyme and PTMs of its components. Indeed, EZH2 was 
reported to be phosphorylated at Thr 350 (refs 76, 77), a modification 
that modulates PRC2 recruitment”. Consistent with the hypothesis that 
ncRNA will be a major player in the cell-specific recruitment of PRC2, 
phosphorylation of EZH2(T350) enhances its binding to ncRNA”. The 
large pool of long ncRNA may function, in part, to direct the complex 
to defined target genes. This targeting may not necessarily entail linear 
base pairing with target sequences, but instead the tertiary structure of 
the RNA may be key to specific target gene recognition. In this regard, 
the global contribution of HOTAIR to PRC2 targeting indicates that 
ncRNA may also regulate overall PRC2-binding properties to chroma- 
tin, either directly or by bridging it to other factors”. Hence, ncRNA 
could regulate the affinity of PRC2 to chromatin in a similar manner 
to the recently described case of the PRC1 component CBX7 (ref. 78). 
The chromodomain of CBX7 was reported to bind both H3K27me3 and 
the ncRNA ANRIL (also known as CDKN2B-AS1), and binding to one 
ligand can modulate the affinity for the other in vitro. 

It is not yet clear whether the initial recruitment of PRC2 to a defined 
gene and the maintenance of its recruitment involve the same mecha- 
nisms. The PRC2 component EED can bind H3K27me3, and PRC2 
enzymatic activity is stimulated by the presence of H3K27me3, thus 
generating a positive-feedback loop”. The importance of this mech- 
anism is illustrated by the phenotype of Drosophila expressing EED 
point mutants that prevent its binding to H3K27me3 without altering 
PRC2 complex formation; this phenotype includes a global reduction 
in H3K27me2/3. Furthermore, given that some PcG proteins seem to 
stay bound to chromatin during replication” and that the same applies 
to PRC2 components during mitosis*’, PRC2 occupancy of chromatin 
may not necessitate its active recruitment to defined chromatin loci in 
all cases. 


PRC2 pluripotency and differentiation 

Two straightforward models could explain the maintenance of stem-cell 
pluripotency in the context of PRC2-mediated gene repression. Pluripo- 
tency is either lost after the expression of developmental regulators that 
promote differentiation, or lost when the expression of factors requisite 
for pluripotency are silenced (Fig. 4a). The first hypothesis is in keep- 
ing with the role of PRC2 in maintaining the repression of numerous 
developmental regulators in ES cells. This led to the suggestion that 
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PRC2 is required for the maintenance of pluripotency”. However, later 
studies reported that ES cells in which a PRC2 component is inacti- 
vated could be kept undifferentiated. This finding draws our attention 
to the second model that posits the requisite repression of pluripotency- 
specific factors'****!. Indeed, in mouse ES cells, the inactivation 
of SUZ12, JARID2 or PCL2 was reported to be associated with an 
inefficient silencing of the pluripotency factors NANOG and OCT4 
(also known as POUSF1)2?”” (Fig. 4a). Furthermore, inactivation of the 
EZH2 homologue MES-2 extends the plasticity phase during embryonic 
development in Caenorhabditis elegans. This observation probably 
results from the failure of the MES-2 mutant to repress genes that should 
only be expressed during a defined window of time in early develop- 
ment. Altogether, it seems that the sustained expression of pluripotency 
factors overtakes the aberrant expression of developmental regulators 
in PRC2-deficient ES cells. 

By contrast, when ES cells are induced to differentiate, misregu- 
lation of developmental programs becomes more apparent. Hence, 
although Eed“ ES cells (formally EED(L196P) but referred to as 
Eed for simplicity) were unimpaired in their ability to contribute 
to all tissue lineages in chimaeric embryos”, Suz12“” ES cells fail to 
form a proper endodermal layer*’, and Ezh2 or Eed ES cells dis- 
play a severe defect in mesoendodermal lineage commitment". This 
phenotype is not restricted to the deletion of PRC2 core components, 
because impaired differentiation was also reported in Jarid2" and 
Pcl2-knockdown ES cells”””’. Of note, in contrast to the mild pheno- 
type that results from PRC2 deletion in ES cells, PRC1 inactivation 
(Ringla Ring1b double knockout) leads to a proliferation defect, and 
ES cells cannot be maintained®’. Furthermore, deletion of the PRC1 
component RING 1B in the context of Eed“ ES cells worsens the dif- 
ferentiation defects”’. These results indicate that PRC1 is not just a 
downstream effector of PRC2, but instead has distinct functions, and 
its recruitment is at least partially PRC2 independent. 

On the basis of the example of ES-cell differentiation, we would 
expect PRC2 inactivation to prevent lineage commitment and termi- 
nal differentiation. Although PRC2 defects do prevent adipogenesis 
and lymphopoiesis***, PRC2 inactivation also promotes differentia- 
tion during myogenesis and epidermis formation®*” (Fig. 4b). During 
B-cell maturation, EZH2 is required for V,,J558 gene rearrangement, 
and the transition from pro-B to pre-B cells is altered in its absence™. 
In the case of epidermis, PRC2 inactivation leads to upregulation of 
epidermal genes mediated by the transcription factor AP1. Those genes 
are normally expressed at the late stage of differentiation®’. Consider- 
ing that only a small subset of its target genes is reactivated after PRC2 
inactivation, it is likely that individual functions encoded by these genes 
dictate the global consequences for cell differentiation. 
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Figure 4| PRC2-mediated regulation of pluripotency and differentiation. a, Comparison of expression levels of pluripotency factors and factors that induce 
cell commitment during ES-cell differentiation in wild-type and PRC2-impaired ES cells. b, The consequences of PRC2 inactivation for cell differentiation. For 
simplicity, intermediate differentiation steps are not shown in some cases (dashed arrows). 
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PRC2 and cancer 

The expression of PRC2 components is upregulated in various cancers 
such as melanoma, lymphoma, and breast and prostate cancer. EZH2 
has been reported to be a marker of the aggressive stages of prostate 
and breast malignancies***’, and its overexpression promotes neoplas- 
tic transformation of normal prostatic cells” and hyperplasia in breast 
epithelium®*’. With the exception of the EZH1 homologue, the expres- 
sion of PRC2 components is regulated by the retinoblastoma protein 
(pRB)-E2F transcription factors pathway, and is therefore associated 
with cell proliferation'*””. In addition, several microRNAs control EZH2 
expression, the deregulation of which could contribute to EZH2 over- 
expression in cancer. Deletion of PRC2 components in somatic cells led 
to a marked reduction in cell proliferation”, an effect that was linked 
to the PRC2-dependent regulation of the Ink4/Arflocus (also known 
as the Cdkn2a locus), which encodes the tumour-suppressor proteins 
p 162") p 19“ and pl 5" (refs 87, 93). Given these findings, EZH2 was 
proposed to function as an oncogene”. Recurrent somatic mutations 
that interfere with EZH2 enzymatic activity occur in subtypes of lym- 
phoma™ and myeloid disorder®”’, but further studies are required to 
determine the consequences of these deletions for PRC2 activity. Despite 
the above-mentioned results, EZH2 inactivation does not inhibit cell 
proliferation in all model cell lines for prostate cancer”. 

To understand the role of PRC2 in tumour progression, it may 
be more beneficial to determine whether PRC2 is required for the 
de-differentiation of somatic cells or for the epithelial to mesenchymal 
transition, rather than modulating EZH2 levels to gauge its function as 
a tumour-suppressor protein versus an oncoprotein in a defined cell 
context. Indeed, the apparent outcome in the latter case is probably 
dependent on the genetic and epigenetic alterations that initiate cellular 
transformation. Notably, PRC2 seems to be required for the acquisition 
of pluripotency, as Eed and Suz12~“" ES cells fail to induce the repro- 
gramming of B cells in a heterokaryon assay”. If similar mechanisms 
operate during the reprogramming of somatic cells and during tumour 
progression, we would expect EZH2 inhibition to be a good approach 
towards preventing the transition to advanced stages of cancer. Yet, if the 
carcinogenic process initiates from cancer stem cells, it will be crucial 
to attain a better understanding of how PRC2 modulates proliferation 
and, in particular, why PRC2 deletion inhibits the proliferation of some 
somatic cells but not ES cells. 


Perspective 
The progress made in understanding the role of PcG proteins, and espe- 
cially PRC2, has underscored their versatility. Not only is PRC2 involved 
in the regulation of a broad array of biological processes, but it also 
establishes regulatory cues that are stable and propagated throughout 
development. These cues can be subject to adjustment at every step of 
differentiation or in response to external stimuli. With such a pivotal role 
in maintaining the repression of different sets of genes depending on the 
cell type and the developmental stage, PRC2 must be targeted to chroma- 
tin bya coordinated and intricate process, the steps of which may entail 
specific DNA sequence(s), ncRNAs and the chromatin structure associ- 
ated with its target genes. However, this model relies on hypotheses that 
require validation. Such validation entails clarification of how ncRNA 
can recognize defined genomic locations and the exact mechanism by 
which JARID2 or PCL proteins contribute to PRC2 recruitment. 
Although it has now been clearly established that several compo- 
nents of PRC2 are misregulated in disease, their involvement has not 
been well defined yet. Mouse models that allow genetic manipula- 
tions, in conjunction with direct comparisons at the genome-wide 
level of normal versus pathogenic tissues in defined genetic back- 
grounds, may provide solid resources for pinpointing the parameters 
of PRC2-dictated processes. = 
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Systemic risk in banking ecosystems 


Andrew G. Haldane! & Robert M. May? 


In the run-up to the recent financial crisis, an increasingly elaborate set of financial instruments emerged, intended to 
optimize returns to individual institutions with seemingly minimal risk. Essentially no attention was given to their 
possible effects on the stability of the system as a whole. Drawing analogies with the dynamics of ecological food 
webs and with networks within which infectious diseases spread, we explore the interplay between complexity and 
stability in deliberately simplified models of financial networks. We suggest some policy lessons that can be drawn from 
such models, with the explicit aim of minimizing systemic risk. 


part as ecologists sought a conceptual foundation for their subject. In 

particular, Evelyn Hutchinson’, following Elton’, suggested that 
“oscillations observed in arctic and boreal fauna may be due in part to 
the communities not being sufficiently complex to damp out oscillations”. 
He went on to state, based on a misunderstanding of MacArthur’s’ paper, 
that there was now a “formal proof of the increase in stability of a com- 
munity as the number of links in its food web increases”. 

To the direct contrary, however, a closer examination of model eco- 
systems showed that a random assembly of N species, each of which had 
feedback mechanisms that would ensure the population’s stability were 
it alone, showed a sharp transition from overall stability to instability as 
the number and strength of interactions among species increased. More 
explicitly, for N >> 1 this transition occurs once moe” > 1, where m is the 
average number of links per species, and (+) « their average strength’. 

In ecology this has, since the 1970s, prompted a search for special 
food-web structures that may help reconcile complexity with persistence 
or stability**. Along these lines there is, for example, tentative evidence 
for modularity’ (particularly in plant—pollinator associations, where 
linkages tend to be overdispersed or disassociative), and more generally 
for nested hierarchies in food webs’”. The fact that some features of the 
network structure of interactions (such as predator/prey ratios) inferred 
from the Burgess Shale communities are similar to those in present day 
ones"! reinforces hopes that this is a meaningful area of research. 

In the wake of the global financial crisis that began in 2007, there is 
increasing recognition of the need to address risk at the systemic level, as 
distinct from focusing on individual banks'*’*. This quest to understand 
the network dynamics of what might be called ‘financial ecosystems’ has 
interesting parallels with ecology in the 1970s. Implicit in much eco- 
nomic thinking in general, and financial mathematics in particular, is 
the notion of a ‘general equilibrium’. Elements of this belief underpin, 
for example, the pricing of complex derivatives. But, as shown below, 
deeper analysis of such systems reveals explicit analogies with the con- 
cept that too much complexity implies instability, which was found 
earlier in model ecosystems. 

There are, of course, major differences between ecosystems and 
financial systems. For one thing, today’s ecosystems are the winnowed 
survivors of long-lasting evolutionary processes, whereas the evolution 
of financial systems is a relatively recent phenomenon™. Nor have 
selective pressures been entirely dispassionate, with the hand of govern- 
ment a constant presence shaping financial structures, especially among 
institutions deemed “too big to fail”'*. In financial ecosystems, evolu- 
tionary forces have often been survival of the fattest rather than the 
fittest. 


I n the 1960s, the notion of the ‘balance of nature’ played a significant 


In what follows, we first consider the role of the growth in intrafinancial 
system claims in generating bank failure and instability, focusing on the 
problems inherent in prevailing methods of pricing complex derivatives, or 
arbitrage pricing theory (APT). Second, we sketch various ways in which 
such an initial bank failure, or ‘shock’, may propagate to cause cascades of 
subsequent failure. Third, we outline some tentative policy lessons that 
might be drawn from these deliberately oversimplified models. Last, we ask 
how we might reshape the financial system to realize the economic benefits 
individual banks can deliver, while at the same time paying deliberate and 
explicit attention to their system-wide stability. 


Potential causes of an initial shock 

Events external to the banking system, such as recessions, major wars, civil 
unrest or environmental catastrophes, clearly have the potential to depress 
the value of a bank’s assets so severely that the system fails. Although 
probably exacerbated by such events, including global imbalances (China 
as producer and saver, the United States as consumer and debtor), the 
present crisis seems more akin to self-harm caused by overexuberance 
within the financial sector itself. Perhaps as much as two-thirds of the 
spectacular growth in banks’ balance sheet over recent decades reflected 
increasing claims within the financial system, rather than with non- 
financial agents. One key driver of this explosive intrasystem activity 
came from the growth in derivative markets. 

In 2002, when Warren Buffet first expressed his view that “derivatives are 
financial weapons of mass destruction”’®, markets—although booming— 
seemed remarkably stable. Their subsequent growth, illustrated in Fig. 1, 
has been extraordinary, outpacing the growth in world gross domestic 
product (GDP) by a factor of three. In some derivatives markets, such as 
credit default swaps (CDS), growth has outpaced Moore’s Law. These 
developments contributed significantly towards an unprecedented influx 
of mathematically skilled people (quantitative analysts) into the financial/ 
banking industry. These people produced very sophisticated tech- 
niques (including APT), which seemingly allowed you to put a price on 
future risks, and thus to trade increasingly complex derivative contracts— 
bundles of assets—with risks apparently decreasing as the bundles grew. 

However, recent empirical and theoretical studies have indicated that 
the trading activity associated with derivatives can have significant effects 
on markets'”""’. More specifically, Brock and colleagues” have shown that 
proliferation of hedging instruments can destabilize markets. Building on 
this, Caccioli and colleagues” note that APT makes several conventional 
assumptions upon which everything else depends: “perfect competition, 
market liquidity, no-arbitrage and market completeness”. Crucially, this 
adds up to the implicit assumption that trading activity has no feedback 
on the dynamical behaviour of markets. And indeed, in the APT-fuelled 
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Figure 1 | Notional principal value of outstanding derivative contracts, as 
recorded at year end. These include foreign exchange, interest rates, equities, 
commodities and credit derivatives. Data from UK Department for Business, 
Innovation and Skills, International Monetary Fund and Bank of England 
calculations. 


boom time that preceded the bust, APT seemed to be very successful. In its 
imaginary world, market failures are caused by regulatory carelessness, 
resulting in a focus on creating institutional arrangements that seek to 
guarantee the premises upon which APT is based”. To the contrary, 
Caccioli and colleagues argued”' that APT is not a ‘theory’ in the sense 
habitually used in the sciences, but rather a set of idealized assumptions on 
which financial engineering is based; that is, APT is part of the problem 
itself. 

Caccioli and colleagues”' illustrate their point by exploring the 
dynamical properties of a model that gives a more realistic caricature 
of markets, going beyond the idealized world of APT to include the 
effects of individual trades on prices. Prices now depend on the balance 
between demand and supply. The outcome is that “the road to efficient, 
arbitrage-free, complete markets can be plagued by singularities which 
arise upon increasing financial complexity””’. 

Figure 2 illustrates the main results of the analysis by Caccioli and 
colleagues*’. Here n is essentially a measure of the proliferation of deriva- 
tives or similar financial instruments, and s is the overall average value of 
the supply of any one such derivative/financial instrument. The parameter 
é encodes the risk premium that banks require for trading derivatives”. 

Wesee from Fig. 2 that ifn is less than n* (here n* = 4.14), the average 
supply of derivatives, s, is relatively steady and essentially independent 
of the banks’ risk premium (as measured by ¢). But as market complexity 
increases, so that n approaches n*, there is a sharp singularity at ¢ = 0. 
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Figure 2 | Discontinuous transition to instability of derivatives as 
complexity increases. Average supply of any one derivative, s, at competitive 
equilibrium as a function of the number, n, of different derivatives being traded, 
for various values of banks’ risk premium, ¢. Adapted with permission from ref. 
21. For fuller discussion, see text. 
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For n>n*, the average supply increases with increasing n (that is, 
increasing proliferation of financial instruments) if ¢ > 0. Conversely, 
for ¢ < 0 the supply decreases with increasing complexity once n > n*. It 
is emphasized”’ that such sensitivity in market behaviour in the neigh- 
bourhood of the singularity can easily produce very strong fluctuations— 
either positive or negative—in the volume of trading in derivative 
markets. 

Note that the consequences of this singularity are not easily intuited 
from the competitive equilibrium setting. It seems to us that the basic 
process—in grossly simplified terms—is that once there are enough deri- 
vatives to span the space of available states of nature (the net supply of 
derivatives within the system necessary to meet true hedging demand 
from non-banks), the market is essentially complete in the sense of the 
Arrow-Debreu” model. Once that happens, gross derivatives positions 
within the system are essentially unbounded. So long as there is an 
incentive to supply new instruments—a positive premium to trading— 
banks will continue to expand gross positions, independent of true 
hedging demand from non-banks. Such trades are essentially redundant, 
increasing the dimensionality and complexity of the network at a cost in 
terms of stability, with no welfare gain because market completeness has 
already been achieved. 

Caccioli and colleagues”! also examine a measure of market volatility as 
the risk premium parameter ¢ varies. If they calculate this quantity under 
the approximation that the fluctuations in the values of the individual 
‘supply variables’ (s;; derivatives, etc) are completely uncorrelated, they in 
effect recover the happy world of APT, with no singularities. This 
strongly indicates that the highly important singularities in their accurate 
and self-consistent calculations, with market dynamics included, are 
associated with the supplies of different derivatives being strongly corre- 
lated in this domain, as has found to be the case among derivatives 
markets in practice. 

In summary, Caccioli and colleagues suggest that the idealized 
assumptions upon which recent financial engineering has been based 
can give a misleading account of potential instabilities in markets. They 
also note that these instabilities echo those that can develop in ecosys- 
tems as complexity increases*”*. 


Propagation of shocks within financial systems 


In ecology’s models of food webs, aimed at qualitative understanding of 
their dynamical response to perturbation, the nodes are simply species, 
linked to other nodes/species as prey, predator, competitor or mutualist. 
In epidemiological networks, the nodes are susceptible, infected/infectious 
or recovered/immune individuals linked by sexual or other contacts. But 
in a minimally realistic caricature of financial networks—henceforth 
called banks—the nodes have a more complex structure. 

Following Nier and colleagues” and Gai and Kapadia”®, we define such 
a bank/node as schematically illustrated in Fig. 3. In this deliberately 
oversimplified scheme, a bank’s activities are partitioned among four 
categories. Two represent assets: interbank loans (/;) and external assets 
(e,). The other two represent liabilities: interbank borrowing (b;) and 
deposits (d;). The subscript i labels the specific bank (i = 1, 2, ..., N for 
a total of N banks). Solvency requires that the difference between a bank’s 
assets and its liabilities (the capital reserve or ‘net worth’, labelled y; in 
Fig. 3) be positive. That is, yy = (e+ I) — (d, + b) =0. 

These banks are now assumed to be interlinked in a random, Erdés— 
Rényi network, with any one of the N banks connected to any other as 
lender or borrower, or possibly both, each with probability p. A bank’s 
average number of incoming/borrowing or outgoing/lending links is 
then z= p(N— 1). 

Various further assumptions are now made to carry these Bank of 
England/Federal Reserve Bank of New York models to the point where 
the knock-on effects of a single bank failure can be explored in numerical 
simulations. Much of the essential findings of such studies can be captured, 
and made more transparent, by a ‘mean-field’ approximation in which 
each bank has exactly average behaviour’’. This means all banks are the 
same size (rescaled to 1), every bank is linked to exactly z others, all loans 
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Figure 3 | Schematic model for a node in the interbank network. Adapted 
with permission from ref. 25. 


have the same magnitude, w, as do the capital reserves, y, and the ratios of 
loans to total assets, 0. 

As illustrated in Fig. 3, all these models study the consequences of a 
shock that initially hits a single bank, wiping out a fraction, f, of its 
external assets. If the magnitude of this shock exceeds the capital reserve, 
f(l — 0) > y, the bank fails. This is a deliberate oversimplification, aimed 
at a clearer understanding of how an initial failure can propagate shocks 
throughout the system. 

The most direct effect of such a failure is that its z creditor banks will 
lose part or all of their loans. If such losses exceed y, these banks in turn 
will fail, propagating a third phase of shocks to those remaining, and so 
on. Note, however, that a failing bank’s losses are in effect divided among 
its z creditors, so that each subsequent phase of loan-driven shocks is 
attenuated, approximately by a factor z. 

Figure 4 illustrates one of the tentative messages emerging from this 
toy model, showing regimes of failure in terms of the critical parameter y 
(capital reserves relative to bank size) and @ (interbank activity as a 
fraction of total assets). Within the unhatched triangle (0, 1, f), the 
initially shocked bank fails; in the blue triangle (0, 1, A) a second tranche 
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Figure 4 | Domains of interbank lending. Domains are expressed as a 
fraction of total assets, 0, and capital reserves or net worth, y, which result in the 
propagation of interbank loan shocks. The triangle (1, 0, f) defines the region 
where loss of a fraction f of a bank’s external assets will cause it to fail. The blue 
triangle (1, 0, A) depicts the region in which creditors of the initially failing bank 
will receive phase II shocks which cause them also to fail, and the red area (1, 0, 
B) shows the region in which phase III shocks cause failure. Adapted with 
permission from ref. 27. 
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of z banks go down; in the red triangle (0, 1, B) there is a third phase of 
roughly 2 failures; and so on. Note that, when (as above) the initial 
shock is to external assets, the system’s fragility is maximized (failures 
for relatively large values of y) by 0 having values intermediate between 0 
and 1, which in some ways very roughly corresponds to banks substan- 
tially engaged in both retail and investment (high-street and casino) 
activity. As seen earlier and in Fig. 4, an increase in the system’s con- 
nectivity, z, causes the coloured region of instability to shrink; high 
connectivity distributes, and thereby attenuates risk. On the other hand, 
when later-phase failures do occur, they will then involve more banks. 

A second, and almost surely more important, source of shock pro- 
pagation arises from losses in the value of a bank’s external assets, caused 
by a generalized fall in market prices, a rise in expected defaults or a 
failing bank’s ‘fire sale’ actions. Such market liquidity shocks are con- 
ventionally and sensibly represented by discount factors that, for a given 
asset class, are proportional to the number of failing banks holding the 
asset. This may be generalized to distinguish between strong liquidity 
shocks, associated with discounting specific asset classes, and weak 
liquidity shocks, resulting from the expectation of further defaults or a 
more general loss of confidence”’. In all cases and in sharp contrast to the 
attenuation in interbank loan shocks, liquidity shocks amplify as more 
banks fail. Thus, relatively small initial liquidity shocks have the poten- 
tial to make strong contributions to systemic risk. 

A third mechanism of shock propagation, which has been a marked— 
and in many peoples’ opinion the most important—feature of the recent 
crisis has been the diminished availability of interbank loans, or in the 
jargon of the trade, ‘funding liquidity shocks’. This has often taken the 
form of liquidity hoarding in interbank funding markets. Gai and 
Kapadia* have recently shown how such liquidity hoarding can cascade 
through a banking network, with severe consequences. As one bank calls 
in or shortens the term of its interbank loans, affected banks tend in turn 
to do the same. The result is a liquidity-hoarding shock that is not 
subject to the attenuation characteristic of interbank default shocks. 

All three propagation mechanisms can be drawn together within the 
framework defined by Fig. 3 (see also N. Arinaminpathy, S. Kapadia and 
R.M.M., manuscript in preparation). The model can also be generalized 
to treat banks of varying size, including the extreme but realistic case of a 
few very large all-purpose banks, each connected to many smaller banks; 
interconnectivity within real banking networks is far from random””’, 
with long-tailed degree distributions. It also seems that these networks 
tend to be disassociative rather than proportionately connected: that is, 
big banks are disproportionately linked to smaller ones, and conversely. 
Such a ‘wiring up’ of a network is known, unfortunately, to maximize the 
number of individuals infected by an agent that is transmitted by inter- 
personal contact*’. On the other hand, such disassociative structures are 
likely to support a larger number of coexisting banks (another link 
between ecology and banking*’), and can make the network more robust 
to random losses””**. 

Some of this work, particularly that on liquidity shocks, echoes an 
important insight from pervious work**’° (N. Beale and colleagues, 
manuscript in preparation). This is that excessive homogeneity within 
a financial system—all the banks doing the same thing—can minimize 
risk for each individual bank, but maximize the probability of the entire 
system collapsing. A very simple toy model illustrates this. Suppose you 
have N banks and N distinct, uncorrelated asset classes, each of which 
has some very small probability, ¢, of having its value decline to the 
extent that a bank holding solely that asset would fail. At the inhomo- 
geneous extreme, assume each bank holds the entirety of one of the N 
assets: the probability for any one bank to fail is now ¢, whereas that for 
the system isa vastly smaller ¢. At the opposite, homogeneous extreme, 
assume all banks are identical, each holding 1/N of every one of the N 
assets: the probability for any one bank to fail can now be calculated as 
N’ ¢N/NI, and this is obviously also the probability for all N of these 
banks to fail. This homogeneous, ‘herding behaviour limit clearly makes 
each individual bank safer, but the systemic risk is much larger. More 
realistic versions of this scenario consistently show the same unhappy 
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conclusion. Tentative evidence comes from the fact that the world’s five 
largest banks have shown increasing concentrations of assets over the 
last ten years, in contrast to the top five hedge funds, whose less con- 
centrated systems can give greater scope for diversity. The former are in 
trouble, the latter much less so. 


Implications for public policy 

All the studies described earlier involve numerical simulations, but 
many combine such work with analytic results of the kind exemplified 
by Fig. 4. Such analysis of the dynamics of deliberately oversimplified 
models of financial ecosystems carries potentially far-reaching implica- 
tions for the design and implementation of public policy. These impli- 
cations include the following. 


Setting regulatory capital/liquidity ratios 

The cornerstone of the current international regulatory agenda is the 
setting of higher requirements for banks’ capital and liquid assets. The 
traditional rationale for such requirements is that they reduce idio- 
syncratic risks to the balance sheets of individual banks. An alternative 
and more far-reaching interpretation is that they are a means of 
strengthening the financial system as a whole by limiting the potential 
for network spillovers. With this wider objective, prudential regulation 
is following in the footsteps of ecology, which has increasingly drawn on 
a system-wide perspective when promoting and managing ecosystem 
resilience. 

The systemic rationale for financial regulatory intervention is well illu- 
strated by the dynamic models outlined earlier. Consider banks’ buffers of 
capital or net worth (y). These capital ratios have been in secular decline in 
relation to banks’ total assets for at least the past 150 years in the United 
Kingdom and United States*’. Reversing these trends by setting higher 
required capital ratios strengthens the absorptive capacity of each of the 
nodes in the financial network in response to external shocks. As impor- 
tantly, however, it also lessens the risk of idiosyncratic defaults cascading 
around the system, as illustrated in Fig. 4. 

Broadly, the same arguments apply in the setting of regulatory 
requirements on banks’ liquid assets. These liquidity ratios have also 
been in secular decline in the United Kingdom and United States, for at 
least the past half century. Typically, liquidity requirements are specified 
as a minimum ratio of banks’ liquid assets to their short-term liabilities. 
This liquidity ratio can be seen as a means of short-circuiting the poten- 
tial for systemic liquidity spillovers arising from fire sales on the asset 
side of the balance sheet (liquidity shocks) or liquidity hoarding on the 
liabilities side (liquidity-hoarding shocks). In particular, holdings of 
liquid assets reduce the potential for market liquidity risk to propagate 
around the system, while limits on short-term liabilities reduce the 
spread of funding liquidity risk around the system. 


Setting systemic regulatory requirements 
Looking at financial risk through a network lens indicates a fundament- 
ally different rationale for prudential regulation. It also indicates a quite 
different calibration of such regulation. Prudential regulation has 
become increasingly risk-based with the advent of first Basel I and 
latterly Basel II. But the risk in question to which regulation was then 
calibrated has tended to be institution-specific rather than systemic risk. 

To give an example, as conventionally calibrated, capital regulation 
seeks to equalize failure probabilities across individual institutions to a 
given tolerance threshold—such as a 0.1% probability of failure. 
Approaching this problem from a system-wide angle indicates a rather 
different calibration. Instead, the objective would be to set firms’ capital 
requirements to equalize the marginal cost to the system as a whole of 
their failure. In other words, regulatory requirements would be set 
higher for those banks bringing greatest risk to the system; for example, 
because of their size or connectivity. 

Although new in the context of banking, the essential insight here is 
an old one in the study of epidemiological networks. Anderson and 
May” established the theoretical case for focusing preventative action 
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on ‘super-spreaders’ within the network to limit the potential for sys- 
tem-wide spread. Although initially applied in the study of contagious 
diseases, such as HIV/AIDs, this same insight has since been applied in 
managing the dynamics of the world wide web, power grids and bio- 
logical ecosystems*””. 

If anything, this same logic applies with even greater force in banking. 
There has been a spectacular rise in the size and concentration of the 
financial system over the past two decades, with the rapid emergence of 
‘super-spreader institutions’ too big, connected or important to fail 
(Fig. 5). The collateral damage, to both the real economy and financial 
system, following the failure of Lehman Brothers in October 2008 is 
testimony to the force of such super-spreader dynamics. Protecting 
the financial system from future such events would require the key 
super-spreader nodes to run with higher—potentially much higher— 
buffers of capital and liquid assets, which are then proportional to the 
system-wide risk they contribute. 

A second source of system-wide risk, in addition to super-spreader 
failures, arises from aggregate external events, such as booms and busts 
in the real economy. Indeed, historically this has been the largest single 
source of banking problems. If regulation could be operated counter- 
cyclically, with buffers rising in booms and falling in recessions, this 
would lessen systemic risk from this particular source. Why? Because 
increasing insurance in a boom would increase system-wide resilience 
against the subsequent bust, as well as providing an incentive for banks 
to curb risk-taking during the boom. Operating regulation in this way 
would be a new departure for prudential policy—so-called macro- 
prudential policy—but a potentially important one**”’ from a systemic 
risk perspective. 


Netting and clearing derivatives 

The rapid growth in the size and complexity of the derivatives market 
contributed importantly to the destabilizing dynamics of the system 
under stress during the recent financial crisis. This begs questions about 
the underlying structure and dimensionality of the derivatives market. 
One means of simplifying the complex web of interactions between 
banks in derivatives markets is to centralize the trading and clearing 
of these instruments. For example, central counterparties interpose 
themselves between every bilateral transaction, thereby replacing a 
cat’s-cradle of financial network interactions with a single hub-and- 
spokes configuration. Provided the central counterparty is extremely 
robust—to prevent it becoming a super-spreader itself—the upshot is 
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Figure 5 | Recent rise in the size and concentration of the United States 
financial system. This figure illustrates the marked increase in asset 
concentration within the United States banking system since the Glass-Steagal 
restrictions were revoked in 1999. Red line represents the Gramm-Leach- 
Bliley Act (1999), which revoked Glass-Steagal restrictions. Data include only 
the insured depository subsidiaries of banks to ensure consistency over time; for 
example, non-deposit subsidiaries are not included. Data from the Federal 
Deposit Insurance Corporation. 
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a less complex and lower-risk financial network. Efforts are underway 
internationally to extend the scope and reach of central counterparty 
clearing, in particular to ensure it covers transactions in complex over- 
the-counter derivative instruments, such as CDS (D. Duffie and H. Zhu, 
manuscript in preparation). 

In parallel, there are international efforts to reduce the dimensionality of 
derivatives contracts by eliminating redundant trades and through netting. 
This redundancy might arise either because contracts have been reassigned 
to participants (but the claim not extinguished) or because there are per- 
fectly offsetting bilateral transactions between two parties that can be netted. 
For example, the stock of CDS contracts has already been reduced by 
around $25 trillion since December 2007 as a result of such netting arrange- 
ments. Looking forward, there may be more sophisticated multilateral net- 
ting algorithms that can be used to reduce further derivatives balances. 


Shaping the topology of the financial network 

The analytic model outlined earlier demonstrates that the topology of 
the financial sector’s balance sheet has fundamental implications for the 
state and dynamics of systemic risk. From a public policy perspective, 
two topological features are key’”. 

First, diversity across the financial system. In the run-up to the crisis, 
and in the pursuit of diversification, banks’ balance sheets and risk 
management systems became increasingly homogenous. For example, 
banks became increasingly reliant on wholesale funding on the liabilities 
side of the balance sheet; in structured credit on the assets side of their 
balance sheet; and managed the resulting risks using the same value-at-risk 
models. This desire for diversification was individually rational from a risk 
perspective. But it came at the expense of lower diversity across the system 
as whole, thereby increasing systemic risk. Homogeneity bred fragility (N. 
Beale and colleagues, manuscript in preparation). 

In regulating the financial system, little effort has as yet been put into 
assessing the system-wide characteristics of the network, such as the 
diversity of its aggregate balance sheet and risk management models. 
Even less effort has been put into providing regulatory incentives to 
promote diversity of balance sheet structures, business models and risk 
management systems. In rebuilding and maintaining the financial sys- 
tem, this systemic diversity objective should probably be given much 
greater prominence by the regulatory community. 

Second, modularity within the financial system. The structure of 
many non-financial networks is explicitly and intentionally modular. 
This includes the design of personal computers and the world wide web 
and the management of forests and utility grids. Modular configurations 
prevent contagion infecting the whole network in the event of nodal 
failure. By limiting the potential for cascades, modularity protects the 
systemic resilience of both natural and constructed networks. 

The same principles apply in banking. That is why there is an ongoing 
debate on the merits of splitting banks, either to limit their size (to curtail 
the strength of cascades following failure) or to limit their activities (to 
curtail the potential for cross-contamination within firms). The recently 
proposed Volcker rule in the United States, quarantining risky hedge 
fund, private equity and proprietary trading activity from other areas of 
banking business, is one example of modularity in practice. In the United 
Kingdom, the new government have recently set up a Royal Commission 
to investigate the case for encouraging modularity and diversity in bank- 
ing ecosystems, as a means of buttressing systemic resilience. 

It took a generation for ecological models to adapt. The same is likely 
to be true of banking and finance. 
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Little is known of the genetic architecture of cancer at the subclonal and single-cell level or in the cells responsible for 
cancer clone maintenance and propagation. Here we have examined this issue in childhood acute lymphoblastic leukaemia 
in which the ETV6-RUNX1 gene fusion is an early or initiating genetic lesion followed by a modest number of recurrent or 
‘driver’ copy number alterations. By multiplexing fluorescence in situ hybridization probes for these mutations, up to eight 
genetic abnormalities can be detected in single cells, a genetic signature of subclones identified and a composite picture of 
subclonal architecture and putative ancestral trees assembled. Subclones in acute lymphoblastic leukaemia have variegated 
genetics and complex, nonlinear or branching evolutionary histories. Copy number alterations are independently and 
reiteratively acquired in subclones of individual patients, and in no preferential order. Clonal architecture is dynamic and is 
subject to change in the lead-up to a diagnosis and in relapse. Leukaemia propagating cells, assayed by serial transplantation 
in NOD/SCID IL2Ry™" mice, are also genetically variegated, mirroring subclonal patterns, and vary in competitive 
regenerative capacity in vivo. These data have implications for cancer genomics and for the targeted therapy of cancer. 


Recent genome-wide scrutiny of cancer cells has revealed extraordinary 
complexity, with substantial numbers of both potential ‘driver’ and 
neutral or ‘passenger’ mutations per case’. Informative though these 
screens are, they probably reflect predominant or composite genetic 
landscapes that obscure the existence of subclonal heterogeneity of 
disease’. Intraclonal genetic diversity is a common feature of cancer* 
and is probably, from a Darwinian, natural selection perspective, the 
essential substrate for clonal evolution, disease progression, relapse or 
metastasis. Subclonal genetic complexity might also be an important 
consideration for therapeutic targeting. Furthermore, if a subset of 
‘stem-like’ cancer cells, or, as we refer to, propagating cells, are the basis 
of sustained clonal expansion and disease progression’ then, in principle, 
they should be genetically diverse if selection and passage through evolu- 
tionary bottlenecks is to occur. 

Identifying intraclonal genetic architecture requires genetic 
scrutiny of single cells or clonal foci, and there are limited examples 
of this so far®; nevertheless, they testify to the existence of significant 
heterogeneity. The genetic diversity of cancer propagating cells is, as 
yet, unexplored. We elected to address this issue in lymphoblastic 
leukaemia. The substantial advantage of this cancer, in addition to 
its amenability to single-cell analysis, is that it is minimally deranged 
or unstable, genetically, and the broad, temporal sequence of genetic 
events is known. For the B-cell precursor subset of childhood acute 
lymphoblastic leukaemia (ALL) with ETV6-RUNX1 fusion studied 
here, the latter genetic lesion is predominantly a prenatal and pre- 
sumed initiating event’. It is coupled with a modest number (3-6) of 
recurrent, genomic copy number alterations (CNA)®*. These accrue as 
secondary and, most likely, postnatal lesions? in genes that, predomi- 
nantly, regulate the cell cycle or B-cell differentiation’. 


Subclonal diversity of genotypes in ALL 
We initially selected 60 cases of ETV6-RUNX1-positive ALL and in 
which ETV6 was also deleted (15-85% of cells) as detected by fluorescence 


in situ hybridization (FISH). Of these, 30 were further selected (see 
Supplementary Table 1) that also had (by FISH) deletion of PAX5 
(n = 15) or CDKN2A (also called p16) (n = 12) or deletions of both 
PAX5 and CDKN2A (n = 3) in at least 10% of cells. All 30 cases were 
then scrutinized using a multiplexed combination of distinctive 
fluorochrome-labelled bacterial artificial chromosome (BAC) probes. 
Two-hundred cells with the ETV6-RUNX1 fusion signal (that is, the 
reference founder mutation present in all leukaemic cells) were 
evaluated for each case and each individual cell designed an allele status 
(that is, mono- or bi-allelic deletion) for ETV6 and PAXS5 (three colour) 
or ETV6 and CDKN2A (three colour) or ETV6, PAX5 and CDKN2A 
(four colour). The use of an ETV6-RUNX1 probe also allowed us to 
detect duplication of the fusion gene (in 15 out of 30 cases) or an extra 
copy of chromosome 21q (via RUNX1 signal copy number; in 21 out of 
30 cases). The latter is a common genetic abnormality in ALL and an 
assumed driver event’. Cutoff levels (%) for scoring genetically dis- 
tinctive subclones were determined using normal blood controls and 
varied depending upon probe set combination. A threshold was set at 
2% for cells with a single CNA (in addition to ETV6-RUNX1 fusion) 
and 1% for cells with two or more CNA (see Methods and Sup- 
plementary Table 2). 

Enumeration of CNA in individual cells in reference to ETV6- 
RUNX1 fusion—the universal marker of all the leukaemic cells— 
allowed us to identify distinctive genetic signatures of subclones 
and their relative frequencies. From this we could infer the most likely 
evolutionary or ancestral relationships between the subclones and 
derive a clonal architecture. 

The genetic architectures that were observed were very diverse. The 
simplest of these genetic architectures that we identified (in 6 cases) 
involved two or three subclones that could be aligned in a linear 
sequence (Fig. la); however, these were cases with the lowest 
complement of CNA (only two or three out of the possible total of 
the seven pre-selected) in addition to the ETV6-RUNX1 fusion. All 
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Figure 1 | Examples of subclonal architecture in ALL. a, Apparent linear 
architecture, with three clones (patient no. 13). Representative FISH images on 
the right show examples of each subclone. b, Moderately complex architecture 
with five subclones (patient no. 8). Loss of the untranslocated ETV6 allele 
occurs independently in three separate subclones (boxes). ¢, Complex 
architecture with eight subclones (patient no. 16). PAX5 deletions occur 
independently in two separate subclones (boxes). Arrows indicate probable (or 
most likely) ancestral derivation of subclones; dashed arrows indicate possible 
(or alternative) origins of subclones. F, yellow signal, ETV6-RUNX1 fusion 
gene; 2 RUNX1, two red signals (one large, one small) corresponding to one 
normal RUNX1 allele, and one small remnant generated from disruption of 
RUNX1 allele involved in the gene fusion; ETV6, green signal corresponding to 
the normal (untranslocated) ETV6 allele; PAX5 (or CDKN24A), pink signal. 


other 24 cases had a more marked subclonal heterogeneity with up to 
ten subclones related via a branching ancestral tree (Fig. 1b, c). 
Figure la-c illustrates examples of the clonal architectures observed 
(all other cases are depicted in Supplementary Fig. 2). 
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Inspection of clonal genotypes reveals some previously unrecognized 
features. It is apparent that the common or highly recurrent CNA are 
not acquired in any preferential order, indicating that their potency as 
oncogenic mutations may not be contingent upon (or epistatic to) 
other CNA. Subclones with the highest number of CNA, positioned 
‘terminally’ in the branching architecture, were not necessarily numer- 
ically dominant (for example, all three cases illustrated in Fig. 1). 
Unexpectedly, CNA involving the same gene could be simultaneously 
present in distinct subclones and must therefore arise more than once, 
independently. ETV6 was independently deleted two to three times in 
14 of the 30 cases (see Fig. 1b, c), PAX5 deleted two or three times in 8 
out of 18 cases (see Fig. 1c) and CDKN2A deleted two times in 4 of 15 
cases. This raises interesting mechanistic questions and suggests that 
these lesions are not only selected on the basis of clonal advantage but 
may be targeted for DNA-level breakage. One possible mechanism is 
via off-target effects of RAGS or AID". 


Immunophenotypes and genetic diversity 


There is a spectrum of early B-lineage differentiation-linked immu- 
nophenotypic signatures in ALL” and evidence has been presented 
indicating that cells with several different antibody-defined pheno- 
types may have leukaemia propagating activity in vivo’. We analysed 
the genetic heterogeneity of cells flow sorted on the basis of their 
expression of CD34 (immature lineage marker) or CD20 (more 
mature B lineage marker). Sorted populations had similarly complex 
genetic architectures (Supplementary Fig. 3). 

Cells with the immunophenotype CD34* CD38 /°“CD19* appear, 
so far, to be unique to ALL. We previously found this pro-B/stem 
population, possibly non-activated or quiescent (CD38), to be signifi- 
cantly enriched in ALL propagating cells when assayed in the NOD/ 
SCID strain of mice'’. When purified by cell sorting, these cells (from 
patient no. 7) had similar genetic complexity to the bulk leukaemic 
population (Supplementary Fig. 3). 


Clonal architecture in ALL is dynamic 


These descriptions of subclonal, genetic profiles in ALL are snapshots 
taken at a particular time point, that is, at diagnosis. It is likely that 
subclonal diversity and the relative dominance of subclones varies 
continuously with the development and progression of disease. ALL 
rarely has an identified prodromal phase but occasionally (~2%) 
patients with ALL have an aplastic, pre-leukaemic phase a few months 
before a diagnosis of leukaemia’*. We previously described one such 
patient with ETV6-RUNX1* ALL’. The diagnostic ALL cells had 
ETV6-RUNX1 fusion but no ETV6 deletion. Single nucleotide poly- 
morphism (SNP) array screening revealed multiple deletions includ- 
ing BTG1 and 11q and gain of chromosome X. We compared the 
clonal genotypes of cells from this patient at these two time points, 
spread some 7 months apart, and observed a marked shift in clonal 
architecture (Fig. 2a). The subclones dominating the aplastic, pre- 
malignant phase were relegated to minor, intermediary subclones in 
the overt leukaemic phase, with dominance of progeny clones that 
had homozygously deleted CDKN2A. A second patient with a pro- 
dromal, aplastic phase some 3 months before a diagnosis of ALL also 
showed a shift in subclonal dominance (Supplementary Fig. 4). 
Treatment and subsequent relapse in ALL reorders the spectrum of 
genetic abnormalities detected by single gene probing’? or SNP 
arrays” reflecting the probable selection of distinct subclones as a 
basis of relapse’. For five of the thirty selected patients (numbers 6, 
9, 28, 29 and 30), we had matched diagnosis and relapse cells available 
for multiplexed FISH analysis. Clonal architecture at relapse was dif- 
ferent from that at diagnosis. The comparative genetic profiles of 
subclones allowed us to identify the most likely subclone giving rise 
to the relapse, although this attribution was not unambiguous (Fig. 2b 
and Supplementary Fig. 2). Relapse seems to derive from either major 
or minor clones at diagnosis as previously suggested”’ but with a 
suggestion that more than one subclone might contribute to relapse 


20 JANUARY 2011 | VOL 469 | NATURE | 357 


©2011 Macmillan Publishers Limited. All rights reserved 


ARTICLE 
a cap —7 months ————»| ALL 


1F 
2 RUNX1 
34% 1 ETV6 > 
2 CDKN2A 
2 BTG1 


© ETV6-RUNX1 
@ RUNX1 
@ETV6 

@ CDKN2A 
@BTG1 


b oto 
AE >, 
4% { 2RUNKT \ 1% 


eee Eee 
“2 PAXS 


2cF 
2 RUNX1 
OETVE 
1 PAXS 


1F 
2% 2 RUNX1 
0 ETV6 


1F 
3 RUNX1 
OETVE 
2 PAXS 


1F 
2 RUNX1 
OETVE 
1 PAXS 


12% 9% 


1 PAXS 


1F 
3 RUNX1 
OETVE 
1 PAXS 


7% 


Figure 2 | Changes in clonal architecture in ALL. a, Scoring for ETV6- 
RUNX1 fusion and simultaneous deletion of CDKN2A and BTG1 in blast cells 
at diagnosis of ALL and in bone marrow taken 7 months earlier during a pre- 
leukaemic aplasia phase. During the pre-leukaemic aplasia phase no deletion of 
CDKN2A was present, although several other copy number abnormalities were 
present including 11q, 15q, 5q and BTGI gene deletions as well as gain of Xq 
(ref. 19). At diagnosis of ALL some 7 months later, the predominant clone 
contained homozygous deletion of CDKN2A. Only clones above the cutoffs are 
shown. Representative, four-colour FISH pictures for the dominant clones 
during the aplasia and leukaemic phases are shown at the bottom. b, An 
example of relapse originating from a major clone at diagnosis (patient no. 9) 
(other matched relapse cases for patients 6, 28, 29, 30 are in Supplementary Fig. 
2). R, probable clonal origin of relapse. 


(for example, patients 6 and 30; Supplementary Fig. 2). The data also 
indicate that the dominant subclone in relapse itself continues to 
genetically diversify, in some cases acquiring genetic lesions in the 
same gene (or chromosome region) as observed in primary, diagnostic 
subclones. This, along with previous observations on distinctive ETV6 
deletions in relapse versus diagnosis”, provides further evidence for 
reiterative CNA. The patterns of genetic diversity observed in relapse 
indicate that genetically distinct leukaemic propagating cells can sur- 
vive chemotherapy and provide a reservoir for relapse and further 
diversification. 


Genetic diversity of propagating cells in ALL 

Within the genetic architecture of ALL, it cannot be assumed that all 
identified subclones are self-sustaining and propagated by cells with 
extensive self-renewing capacity®. As in evolutionary speciation, it is 
likely that some branches or subclones are long-lived whereas others are 
dead ends or out-competed. Nevertheless, the architectural patterns 
that we observed suggested the possibility that propagating cells for 
ALL might also have variegated genetics and that this should be demon- 
strable via serial transplantation in immunodeficient mice. We trans- 
planted, intra-tibially, varying numbers (2 x 10°-10°) of unfractionated 


358 | NATURE | VOL 469 | 20 JANUARY 2011 


or immunophenotypically flow-sorted leukaemic cells into pre- 
irradiated NOD/SCID IL2Ry"™" mice. Expanded leukaemic popula- 
tions were re-transplanted into secondary recipient mice as a validation 
of self-renewal capacity. We compared the genetic signatures of the cell 
populations that emerged by successful regeneration in vivo, from first 
and secondary transplants, with those in the original diagnostic sample. 
Mice with regenerated ALL had significant proportions (3.1 to 93.5 av. 
59.4; Supplementary Tables 3 and 4) of human haematopoietic 
(CD45*) cells in the marrow and large, pale spleens (Supplementary 
Fig. 5b, c). Effectively, all (>99%) human CD45°* cells were leukaemic 
with the ETV6-RUNX1 fusion (Supplementary Fig. 5d). Genetic ana- 
lysis was carried out on cells harvested from bone marrow but when 
assessed, spleen provided the same result (Supplementary Table 4). 

Leukaemic regeneration in vivo was observed consistently in both 
unfractionated populations and in fractions defined immunopheno- 
typically as CD34 CD38" "°YCD19* and CD34*CD38*CD19* (Sup- 
plementary Tables 3 and 4). This accords with previous evidence’ that 
propagating cells in B-cell precursor ALL are not restricted to one 
immunophenotypic compartment. 

In all 24 mice with primary or secondary leukaemic regeneration, 
several genetically distinct subclones were present, the patterns of 
which reflected the diversity of subclones identified in the original 
diagnostic sample (Figs 3 and 4 and Supplementary Tables 3 and 4). 
The leukaemic cells regenerated in secondary transplants were com- 
pared to pre-transplant primary cells by high-resolution SNP arrays. 
For patient no. 3, these data confirmed subclonal loss of CDKN2A, 
subclonal gain of chromosome 21 and loss of one copy of ETV6 in 
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Figure 3 | Genetics of cells propagating NOD/SCID IL2Ry""" mice. 
Leukaemic cells from patient no. 3 before injection (a), after primary 
transplantation (b; mouse 1, Supplementary Table 3) and after secondary 
transplantation (c; mouse 2, Supplementary Table 3). Representative FISH 
images are shown of the four subclones (B-E) and the putative pre-leukaemic 
cell (A) at the top in a. Boxed cell, below significance threshold. 
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Figure 4 | Shift in clonal architecture of ALL after in vivo NOD/SCID 
IL2Ry"™" transplantation. Clonal architecture of unsorted cells from patient 
no. 7 before injection (a), after primary transplantation (b; mouse 1, 
Supplementary Table 4) and after secondary transplantation (c; mouse 2, 
Supplementary Table 4). d, Summary of SNP data on diagnostic versus 
secondary transplant ALL cells. The blue lines indicate the mean copy number 
plot of five contiguous SNPs. Middle line, normal diploid copy number. Two 
fractions of DNA have been examined using 500K SNP arrays: diagnostic DNA 
(unsorted cells from patient no. 7 before injection) and DNA from leukaemic 
cells regenerated in mice (mouse 2, Supplementary Table 4). Deletions of 
CDKN2A, BTLA and BTG1 are present in both diagnostic and regenerated 
samples, whereas deletion of ETV6, subclonal gain of the 13q31.3-q34 region 
and gain of chromosome 22 are distinctive between the two samples. Plus and 
minus symbols indicate gains or losses of genetic regions, respectively. Arrows 
highlight CNA. 


both diagnostic and regenerated samples (Supplementary Fig. 6 and 
Supplementary Table 3). 

The genetic profiles of regenerated leukaemias revealed variable 
potency of genetically distinct subclones. For patient no. 3 (Fig. 3 
and Supplementary Table 3), four subclones read-out in primary 
and secondary transplants with three being co-dominant. The putative 
pre-leukaemic clone (with ETV6-RUNX1 fusion only) did not regenerate. 
With unfractionated cells from patient no. 7 (Fig. 4 and Supplementary 
Table 4), four of six subclones regenerated upon secondary transplanta- 
tion. One of these (subclone F in Fig. 4) was dominant despite being a 
minor (11%) subclone in the initial diagnostic sample. SNP arrays were 
used to compare the primary diagnostic sample (patient no. 7) versus 
the regenerated (secondary) transplant leukaemias. These revealed that 
both leukaemic populations had BTLA and BTG1 deletions (Fig. 4d) in 
addition to the CNA in CDKN2A, ETV6 and chromosome 21. The 
regenerated leukaemia had an additional chromosome 22 that appeared 
to be absent from the initial, diagnostic cell population (Fig. 4d). This 
was further investigated by FISH using a chromosome-22-specific BAC 
probe. This confirmed the SNP array data with the majority of cells 
(with three RUNX1 and one ETV6 signals) in the regenerated leukaemia 
having an extra chromosome 22 signal (Supplementary Fig. 7). No cells 
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with an extra chromosome 22 signal were detectable in the initial 
diagnostic sample in accord with the SNP array data. However, because 
clone F was dominant in all 9 of 9 mice transplanted with the same cell 
population (Supplementary Table 4), we assume that a minor subclone 
of clone F with the extra chromosome 22 was present in the diagnostic 
sample but at a<1% frequency. Variable competitive potency of 
subclonal regeneration was also seen in mice injected with immuno- 
logically fractionated cells (Supplementary Table 4 and Supplementary 
Figs 7 and 8). 

These data are indicative of additional genetic complexity of sub- 
clones and their propagating cells. Moreover, they indicate that distinc- 
tive genotypes are associated, functionally, with variable competitive 
regeneration in vivo. 


Discussion 


Cancer developmentat the cellular level is widely regarded as a Darwinian 
evolutionary process involving ‘natural selection’ of genetically variant 
cells in the context of a complex micro-environmental ecology”. 
Mutational and phenotypic diversity between cells is, in principle, fun- 
damental to this process. Moreover, driver mutations can be expected to 
have maximal selective currency when present in cells with self-renewing 
functionality. 

Evidence for intraclonal genetic diversity in cancer has been pro- 
vided by chromosome karyotype”, by genetic analysis of multi-focal 
(but monoclonal) cancers”®, by FISH-based screening of tissue sec- 
tions’? or immuno-selected cells*®, by the molecular probing of 
multiple small biopsies*' or of micro-dissected tissue’ ** and, recently, 
by sector-ploidy profiling’. Small numbers of individual circulating 
tumour cells have also been scrutinized for their divergent genetic 
profiles**’’. These studies collectively testify that contemporaneous 
intraclonal genetic heterogeneity is commonplace and, in some cases 
at least, the degree of clonal diversity is predictive of disease progres- 
sion*'. Most of these data derive from epithelial carcinomas with com- 
plex genetic profiles, coupled, in most cases, to genetic instability. In 
such cases the historical timing and sequence of critical or driver 
mutational events is effectively buried and clonal architecture could 
be extremely complex unless clonal dominance occurs. 

A common assumption for both leukaemias and cancer in general, 
based on the original evolutionary model of ref. 22, is that progression of 
disease and predominant genetic profiles reflect sequentially dominant 
clones and an essentially linear dynamic. Our data (summarized in 
Supplementary Fig. 1) suggest dynamic patterns of subclonal develop- 
ment and ancestral relationships that are nonlinear with a variable 
branching architecture. Patterns of genetic diversity in other 
cancers—assessed by single cell or ploidy sorted cell comparative geno- 
mic hybridization (CGH)****”, oncogenically neutral microsatellite 
markers**“° or deep-sequenced IGH gene rearrangements*'—also indi- 
cate nonlinear, branching clonal trajectories. Collectively, these data 
indicate that cancer has a cellular and genetic architecture reminiscent 
of Darwin’s iconic evolutionary tree (or bush) diagram depicting 
speciation”. 

The extent of genetic variegation in subclones that we detect must 
be a significant underestimate. We screened for a limited number of 
pre-selected CNA, which means that other CNA plus any sequence- 
based driver mutations present will not have been registered. Moreover, 
antecedent or intermediary subclones, below the 1-2% frequency which 
we set as a threshold, were identified with more extensive screening in 
several cases (see Supplementary Fig. 2 patients 2, 5, 15, 18, 20, 6). 
Identifying the full complexity of subclonal architecture and genetic 
diversity in ALL (and other cancers) will ultimately require whole- 
genome analysis at the single cell level. 

Our data provide the first direct evidence for genetic diversity of 
cancer propagating cells within individual patients. The consistent 
patterns of subclonal regeneration in mice (that is, in different mice 
injected with the same cellular inoculum) suggest variable capacity 
intrinsically associated with the distinct genotypes of propagating 
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cells. The competitive potency of particular subclones observed, 
however, may to some extent reflect selective pressures exerted by 
regenerative stress in a murine tissue environment. Natural clonal 
selection in patients might produce different outcomes. 

It will be important to assess if genetic diversity of propagating cells 
holds true for other types of leukaemia and cancer in general. If it 
does, then there would be significant implications for both the cancer- 
stem-cell concept itself and for the therapeutic targeting of such cells. 
The original model of a distinct, hierarchically positioned subpopula- 
tion of cancer stem cells’ has proved contentious in both ALL’* and 
other cancers*’. It has been suggested that the NOD/SCID in vivo 
readout for human cancer stem cell may, at least for some cancers, 
simply register dominant subclones**“*. Or, alternatively, that cancer 
stem cells exist but evolve over time****. We have previously docu- 
mented that ‘pre-leukaemic’ and overt leukaemia propagating cells in 
ETV6-RUNX1-positive ALL, although clonally related by descent, are 
distinctive in IgH rearrangements and phenotype’. Our current data 
fit best with what we refer to as a ‘back to Darwin’ model for cancer 
propagating cells and resultant clonal architecture**. In this, cells with 
self-renewing properties have variegated genotypes providing the 
units of selection in the evolutionary diversification and progression 
of disease. Both sequential and concurrent genotypic variation in 
propagating cells occur in ALL and, we predict, are likely to do so 
in other cancers, providing a rich substrate for progression of disease. 
Although it has yet to be evaluated, it is likely that genetic diversity of 
cancer propagating cells will be associated with both frequency vari- 
ation and diversity of functional properties, for example, differenti- 
ation status, niche occupancy, quiescence and drug or irradiation 
sensitivity. This may help to explain some of the inconsistencies 
and controversies in the cancer-stem-cell field** *”**”. Genetic diversity 
in cancer varies in extent with stage of disease’, probably reflecting 
the impact of intraclonal competition and ecological bottlenecks. Single 
cells might negotiate very stringent bottlenecks but the genetic profiles 
that we observed in relapsed ALL and as recorded in, for example, 
prostate cancer metastases*”*? indicate continued diversification of 
propagating cells and dominant or therapy-resistant subclones. 

This perspective contrasts with the unidimensional or flat (albeit 
very complex) genetic landscapes of cancer implied in portraits derived 
from whole-genome scans. This architectural distinction may be of 
some clinical consequence. Targeted therapy, if directed at mutant 
molecules, may have limited efficacy if the targets themselves are not 
initiating lesions but secondary mutations segregated in subclones, 
even when the latter appear dominant. Genetic variegation of cancer 
propagating cells may represent a significant roadblock to effective 
therapy. 


METHODS SUMMARY 


Archival methanol:acetic-acid-fixed cytogenetic pellets from patients with ETV6- 
RUNX1 fusion-gene-positive ALL were obtained from several UK hospitals, with 
local ethical review committee approval (CCR 2285, Royal Marsden Hospital 
NHS Foundation Trust). The clinical and cytogenetic data on these patients are 
given in Supplementary Table 1. Interphase FISH was performed as previously 
described”. 

In each case, at least 200 nuclei were scored for the presence of the ETV6- 
RUNX1 fusion gene in combination with hemizygous or homozygous deletion of 
ETV6, RUNX1, PAX5, CDKN2A and BTG1 and 11g, as well as gain of RUNX1 and 
duplication of the ETV6-RUNX1 fusion gene. Controls included the scoring of 
residual normal cells within the diagnostic sample and scoring leukaemic cells 
with probes hybridizing to irrelevant oncogenes (BCR, ABL) (see main Methods 
and Supplementary Figs 9 and 10). 

NOD/SCID IL2Ry™" mice that lack any B, T and natural killer cell activity were 
bred and maintained under sterile conditions in accordance with Home Office 
regulations. Transplantation of cells was by intra-tibial injections in 7-14-week- 
old mice after 250cGy irradiation. Peripheral engraftment was assessed at 
9-10 weeks after transplantation and if >2% mice were killed. Further analysis 
included the assessment of bone marrow/spleen engraftment, FISH analysis, his- 
tological analysis and serial transplantation. For serial transplantations, recovered 
bone marrow cells were stained with human CD45 to detect human engraftment. 
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An equivalent of 2 X 10° to 2 X 10° human cells was transplanted by intra-tibial 
injections. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Interphase fluorescence in situ hybridization (FISH). Archival methanol:acetic- 
acid-fixed cytogenetic pellets from patients with ETV6-RUNX1 fusion-gene- 
positive ALL were obtained from several UK hospitals, with local ethical review 
committee approval (CCR 2285, Royal Marsden Hospital NHS Foundation Trust). 
Interphase FISH for the ETV6-RUNX1 fusion gene was performed using a com- 
mercial LSI TEL-AML1 extra signal (ES) probe (Vysis, Abbott Laboratories Ltd) 
according to the manufacturers’ instructions. This probe set contains a 350-kb 
probe for the 5’ end of ETV6 (exons 1-4) and a 500-kb probe covering the entire 
RUNX1 gene. The FISH signal pattern for the ETV6-RUNX1 fusion-gene-positive 
cells using the Vysis probe is two red (one large, one small RUNX1 signals), one 
green (ETV6allele not involved in the translocation), one red/green (yellow) fusion 
signal corresponding to the ETV6-RUNX1 fusion gene. Bacterial artificial chro- 
mosome (BAC) or fosmid probes for the PAX5, CDKN2A, BTG1, TBL1XR1 genes, 
11q and other regions of interest were obtained from the BACPAC Resource 
Centre, Children’s Hospital, Oakland Research Institute (http://bacpac.chori.org). 
These were labelled by nick translation with biotin-16-dUTP or digoxigenin-11- 
dUTP (Roche) and hybridized in combination with the ETV6-RUNX1 ES probe. 
FISH was performed by standard protocols”'? and labelled probes detected with 
streptavidin-Cy5 (bioytinylated probes) and (1) monoclonal anti-digoxigenin 
(Sigma), (2) horse anti-mouse IgG-Texas red (Vector Laboratories) and (3) goat 
anti-horse IgG-Texas Red (Jackson Immunochemicals) (for digoxigenin-labelled 
probes). Fluorescent signals were viewed using an Olympus AX2 fluorescence 
microscope equipped with narrow bandpass filters for DAPI, FITC, Spectrum 
orange, Texas red and Cy5. Images were captured and analysed using a charge- 
coupled device (Photometrics) and SmartCapture 3 software version 3.0.4 (Digital 
Scientific). 

Establishing cutoff levels. In each case, at least 200 nuclei were scored for the 
presence of the ETV6-RUNX1 fusion gene in combination with hemizygous or 
homozygous deletion of ETV6, RUNX1, PAX5, CDKN2A and BTGI and 11q, as 
well as gain of RUNX1 and the ETV6-RUNX1 fusion gene. Diagnostic slides from 
26 cases were assessed for hybridization efficiency by scoring the residual normal 
(ETV6-RUNX1 fusion negative) cells on the same slide (see Supplementary Fig. 9). 
The percentage of these cells with loss of a single CDKN2A or PAXS signal was 
0-3% (mean = 1%). As a further control, we hybridized a subset (n = 11) of 
ETV6-RUNX1 fusion-gene-positive cases with uninvolved oncogene probes 
BCR and ABL (see Supplementary Fig. 10). The percentage of cells with the 
expected normal signal pattern (two red, two green) was 96-99% (mean = 98%). 
Cutoff levels for each probe were established by three-colour FISH (test probe in 
combination with the ETV6-RUNX1 ES probe) using three normal control 
peripheral blood slides per probe. As ETV6 and RUNX1 were scored on each slide, 
the values for these two probes were based on 12 slides in total. We used a 
cutoff = mean + 2X standard deviation. Cutoff levels for each probe (in com- 
bination with the ETV6-RUNX1 fusion) were established using 3-12 normal 
control peripheral blood slides. The cutoff levels for three-colour FISH experi- 
ments are given in Supplementary Table 2. Cutoff levels for four-colour FISH were 
the same as above, except for Texas red probes. The cutoff for loss of one signal 
using a fourth probe detected with Texas red was 6.9%, because of spectral overlap 
between Texas red (used to detect digoxigenin-labelled probes) and Spectrum 
orange (used to label RUNX1 in the commercial ETV6-RUNX1 ES probe). 
However, in most cases we used three-colour FISH as a cross-check to infer 
whether clones below this cutoff were real (Supplementary Fig. 2). Only fusion- 
gene-positive cells that also showed the small extra red signal (generated by dis- 
ruption of RUNX1) were used to calculate the relative frequencies of the various 
subclones. 

Genome mapping analysis. Mapping analysis was performed using 500 ng of 
tumour DNA. DNA was prepared according to manufacturer’s instructions using 
the GeneChip mapping 500K assay protocol for hybridization to GeneChip 
Mapping 250K Nsp and Sty arrays (Affymetrix). Briefly, genomic DNA was 
digested in parallel with restriction endonucleases NspI and Styl, ligated to an 
adaptor, and subjected to polymerase chain reaction (PCR) amplification with 
adaptor-specific primers. The PCR products were digested with DNasel and 
labelled with a biotinylated nucleotide analogue. The labelled DNA fragments 
were hybridized to the microarray, stained by streptavidin-phycoerythrin 


conjugates, and washed using the Affymetrix Fluidics Station 450 then scanned 
with a GeneChip scanner 3000 7G. 

Copy number and LOH analysis. SNP genotypes were obtained using Affymetrix 
GCOS software (version 1.4) to obtain raw feature intensity and Affymetrix 
GTYPE software (version 4.0) using the BRLMM algorithm to derive SNP geno- 
types. The samples were analysed using CNAG 3.0 (http://plaza.umin.ac.jp/ 
genome), comparing tumour sample with unpaired control DNA to determine 
copy number and loss of heterozygosity (LOH) caused by imbalance*’. The posi- 
tion of regions of LOH were identified using the University of California Santa 
Cruz (UCSC) Genome Browser, May 2004 Assembly (http://genome.ucsc.edu/ 
cgi-bin/hgGateway). 

Combined fluorescence immunophenotype and FISH. Bone marrow and 
spleen cells from leukaemic mice were cytospun and used for combined or tri- 
ple-colour immunophenotype/FISH analysis as previously described”. Briefly, 
cells were air dried and fixed in acetone before incubating in primary biotinylated 
mouse anti-human CD45 (Clone F10-89-4) and detecting with Avidin-AMCA 
(Vector Laboratories). After antibody staining, the slides were hybridized with the 
Vysis ETV6-RUNX1 ES fusion gene FISH probe as described above. Cells were 
viewed using a Zeiss Axioskop fluorescence microscope fitted with a dual bandpass 
FITC and rhodamine filter, as well as individual DAPI (for AMCA immunophe- 
notype), FITC, rhodamine and Cy5. Images were captured using a charge-coupled 
device (Photometrics) and fluorescence signals merged and analysed using 
SmartCapture X software version 2.6.2 (Digital Scientific). 

Cell separation, phenotyping and sorting. Total mononuclear cells were iso- 
lated by Ficoll gradient centrifugation and directly cryopreserved in DMSO for 
later use. After thawing dead cells were evaluated and excluded by FACS after 
staining with Hoechst 33258 (Invitrogen). For sorting CD34 and CD20 positive 
and negative subsets, samples were stained with either mouse anti-human CD34 
(IgG,, Dako) or CD20 (IgG, Southern Biotech) followed by anti-mouse IgG 
labelled with Pacific blue (Invitrogen). Nonspecific binding of antibody was 
assessed using mouse IgG, isotype control stained samples (Dako). Sorting was 
performed on a BD FACSAria with analysis by Becton Dickinson FACSDiVa 
software. Samples were gated on forward- and side-scatter plots for mononuclear 
cells and further gated in forward-scatter height versus area to exclude clumped 
cells. Before xeno-transplantation, some cells were stained with anti-CD19 PE 
(BD Pharmingen), CD34 FITC (BD Pharmingen) and CD38 APC (BD- 
Pharmingen). CD34*38~"°“CD19* and pro-B CD34*CD38*CD19* cells were 
purified by flow cytometry (in this case, using MoFlo, Dako). Data acquisition 
and analysis were done with Summit (Dako) software. For multi-colour cell 
sorting ‘fluorescence minus one’ controls were used to determine positive and 
negative staining boundaries”. Human cells regenerating in mice were identified 
by staining with anti-CD45 PeCy7 (BD Pharmingen). 

NOD/SCID mouse transplantation. NOD/SCID IL2Ry"™" mice that lack any B, 
T and natural killer cell activity were bred and maintained at the Weatherall 
Institute of Molecular Medicine animal facility in accordance with Home Office 
regulations. Animals were handled under sterile conditions. Transplantations of 
2 X 10°-10° cells were performed by intra-tibial injections in 7-14-week-old mice. 
Recipients received 250cGy of total body irradiation before cell injection. 
Peripheral engraftment was assessed at 9-10 weeks after transplantation and if 
peripheral engraftment was >2% mice were killed. Further analysis included the 
assessment of bone marrow/spleen engraftment, FISH analysis, histological ana- 
lysis and serial transplantation. For serial transplantations recovered bone marrow 
cells were stained with human CD45 to detect human engraftment. An equivalent 
of 2 X 10° to 2 X 10° human cells was transplanted by intra-tibial injections. 
May-Griinwald Giemsa staining. The histological analysis of patient samples 
and mouse bone marrow was performed by May-Griinwald Giemsa staining of 
bone marrow smears, bone marrow cytospin preparations and spleen swabs. 
Slides were analysed on an Olympus BX60 microscope. 
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Evolution of human BCR-ABL1 
lymphoblastic leukaemia- initiating cells 


Faiyaz Notta’?*, Charles G. Mullighan**, Jean C. Y. Wang", Armando Poeppl’, Sergei Doulatov'?, Letha A. Phillips®, Jing Ma’, 
Mark D. Minden*, James R. Downing? & John E. Dick? 


Many tumours are composed of genetically diverse cells; however, little is known about how diversity evolves or the 
impact that diversity has on functional properties. Here, using xenografting and DNA copy number alteration (CNA) 
profiling of human BCR-ABLI lymphoblastic leukaemia, we demonstrate that genetic diversity occurs in functionally 
defined leukaemia-initiating cells and that many diagnostic patient samples contain multiple genetically distinct 
leukaemia-initiating cell subclones. Reconstructing the subclonal genetic ancestry of several samples by CNA 
profiling demonstrated a branching multi-clonal evolution model of leukaemogenesis, rather than linear succession. 
For some patient samples, the predominant diagnostic clone repopulated xenografts, whereas in others it was 
outcompeted by minor subclones. Reconstitution with the predominant diagnosis clone was associated with more 
aggressive growth properties in xenografts, deletion of CDKN2A and CDKN2B, and a trend towards poorer patient 
outcome. Our findings link clonal diversity with leukaemia-initiating-cell function and underscore the importance of 


developing therapies that eradicate all intratumoral subclones. 


A widely accepted tenet of cancer biology is that most tumours arise 
from single cells and that multiple genetic alterations accumulate over 
time, resulting in transformation’. Historically, this process was con- 
sidered to be a stepwise acquisition of new mutations, some of which 
provide a competitive growth advantage, resulting in successive rounds 
of clonal expansion with concomitant loss of earlier, less-fit clones”. In 
this model of tumour evolution, all clones are linearly related to each 
other. However, new genomic technologies are revealing a more com- 
plex clonal architecture in some cancers**. Analysis of chromosomal 
translocation breakpoints and CNA profiling in twins with ETV6- 
RUNX1-positive acute lymphoblastic leukaemia (ALL) showed that a 
pre-leukaemic clone is initiated in utero that expands, seeds both twins, 
and then evolves with different kinetics and CNA acquisition in each 
twin’*®. Genome-wide CNA profiling of paired diagnostic and relapse 
ALL samples has been particularly informative’. In most cases, the 
relapse clone shared only limited genetic identity with the predomi- 
nant diagnostic clone and did not evolve from it. In the rest, the relapse 
clone was either identical or a direct evolutionary product of the dia- 
gnostic clone. These studies predicted the existence of an ancestral, 
pre-diagnostic clone that gave rise to at least two clonal lineages that 
evolved independently in many patients with ALL, with each clone 
acquiring different genetic aberrations: one clone giving rise to the 
dominant diagnostic clone and the other emerging as the predominant 
clone at relapse with the acquisition of additional CNA. These results 
indicate that tumour evolution may occur through a more complex 
branching model that gives rise to genetically distinct subclones at 
diagnosis that vary in aggressiveness and response to therapy’’. 
However, proof of this model requires studies directly examining the 
functional properties of the cells in which genetic changes are found. 
Indeed, within leukaemias, non-proliferating or pre-apoptotic cells are 
often present and thus incapable of contributing to long-term clonal 
maintenance or to relapse. Therefore, functional studies of the cells 


responsible for driving leukaemic growth in patients must be com- 
bined with genetic approaches to identify genetically diverse subclones 
and their evolutionary ancestral precursors, and to show whether they 
possess biologically distinct growth properties. 

Arguably the most important biological function a cancer cell can 
have is the ability to sustain clonal growth”, a property best measured 
by tumour initiation assays in primary and secondary recipients. 
Indeed, some highly aggressive or metastatic tumours of mice and 
humans seem to be functionally homogeneous because almost every 
cell has tumour-initiating-cell capacity. However, most tumours 
appear to be functionally heterogeneous, as the tumour-initiating cells 
or leukaemia-initiating cells typically represent a minor fraction, 
although their frequency varies widely in syngeneic’* or xenograft’ 
recipients. It has been widely considered that intratumoral functional 
heterogeneity results from stochastic processes that influence cell 
growth but also from the variable behaviour of genetic subclones that 
arise through clonal evolution. The cancer stem cell model proposes 
an alternative explanation based on the hierarchical organization of 
the tumour clone where cancer stem cells are solely responsible for 
driving clonal growth and for therapeutic resistance. In the cancer 
stem cell model, tumour-initiating cells and cancer stem cells are 
synonymous and have the properties of self-renewal and maturation 
that are canonical to all stem cells. Epigenetic or developmental pro- 
grams contribute to functional differences between cancer stem cells 
and non-cancer stem cells within a tumour clone that the model 
assumes would be genetically identical’*’*. The cancer stem cell and 
clonal evolution models are the subject of intense debate and often 
considered to be mutually exclusive’®'”. The cancer stem cell model 
focuses on the concept of functional heterogeneity but does not take 
into account tumour evolution, intratumoral genetic variation, or the 
existence of genetically distinct subclones. On the other hand, the clonal 
evolution model focuses on genetic heterogeneity without considering 
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the functional variation that might exist intratumorally within indi- 
vidual genetic subclones. As a first step to resolve the basis for intra- 
tumoral heterogeneity, functional assays must be combined with genetic 
analysis to determine whether tumours contain genetically distinct sub- 
clones of tumour-initiating cells. 

Philadelphia chromosome acutelymphoblasticleukaemia (Philadelphia- 
positive (Ph*) ALL) is an ideal disease in which to study the relationship 
between intratumoral clonal diversity, genetic alterations and cellular 
growth properties because it is considered a single clinical entity with 
identifiable and recurrent genetic abnormalities. Detailed studies have 
revealed a number of genetic alterations, notably deletions of the 
lymphoid transcriptional regulator IKAROS (also called IKZF1), 
PAX5, EBF1, as well as deletions involving CDKN2A/B that cooperate 
with BCR-ABL1 in lymphoid leukaemogenesis’*. Furthermore, the 
association of IKZFI deletion in Ph-negative ALL with poor patient 
outcome predicts that it will be possible to link specific genetic altera- 
tions with function’’. Here we report development of a robust Ph* ALL 
xenograft system that was used to carry out a combined genetic and 
functional study of the genetic diversity of functionally defined tumour- 
initiating cells derived from diagnostic patient samples. 


Modelling human Ph* ALL in xenografts 


To determine whether a single genetic subtype of leukaemia like Ph* 
ALL exhibits uniform growth properties, we used three xenograft 
models of increasing immune deficiency: NOD.CB17-Prkdc““/J 
(NOD/SCID) mice; NOD/SCID mice treated with anti-CD122 to 
deplete innate immune cells (NS122)*°”'; or NOD/SCID mice with 
deletion of the common gamma (y)-chain (NSG)” (Supplementary 
Fig. 1). Diagnosis samples from 18 of 20 Ph* ALL patients efficiently 
engrafted NS122 mice (Supplementary Fig. 2) and recapitulated 
numerous aspects of the human disease including tumour dissemina- 
tion, immunophenotype (Supplementary Fig. 3) and morphology 
(Supplementary Fig. 4). However, 10 of 20 patient samples caused 
clinically manifest disease before 15 weeks and were categorized as 
aggressive group 1 samples, whereas the remaining xenograft mice 
appeared healthy until they were killed and these were classified as 
non-aggressive group 2 samples (Fig. 1a). Accordingly, the leukaemic 
burden in bone marrow and systemic dissemination was significantly 
higher in group 1 versus group 2 samples (Fig. 1b, c, Supplementary 
Fig. 2b and Supplementary Table 1). Notably, group 1 samples 
engrafted all recipient types (Fig. 1d and Supplementary Fig. 5) even 
when transplanted at near-limiting dose (Supplementary Fig. 6). By 
contrast, group 2 samples failed to engraft to NOD/SCID mice 
(Fig. 1d and Supplementary Fig. 5) despite injection of 50-fold more 
cells (Supplementary Fig. 7). Of two group 2 samples (patient 2-5 and 
2-6) unable to engraft NS122 recipients, one engrafted NSG mice 
(Supplementary Fig. 8). The extent of leukaemic dissemination was 
similar in NS122 or NSG mice for both group 1 and group 2 samples, 
although NSG mice had higher peripheral engraftment levels (Sup- 
plementary Fig. 9). 


Genetic basis of functional heterogeneity 


To examine a possible genetic basis for the distinct xenograft growth 
properties between group 1 and group 2 samples, genome-wide CNA 
profiling was undertaken. Overall, the frequency of genetic alterations 
in IKZF1 (84%), CDKN2A/B (50%) and PAX5 (50%) of our 20 samples 
paralleled previous studies’*. A similar frequency of group 1 and group 
2 samples had focal and complete deletions of the IKZF1 locus. 
However, there were marked differences in the proportion with 
deletions of CDKN2A/B (group 1, 90%; group 2, 0%; P = 0.0001) 
and PAX5 (group 1, 60%; group 2, 10%; P = 0.057) genes (Fig. 2e 
and Supplementary Table 2). Genomic quantitative polymerase 
chain reaction (qPCR) confirmed that the CDKN2A/B locus was not 
deleted in representative group 2 samples (Supplementary Table 3) nor 
hypermethylated in an independent cohort of Ph* ALL patients 
(Supplementary Fig. 10). 
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Because previous studies have reported a positive association 
between the efficiency of xenograft engraftment and clinical outcome 
in acute myeloid leukaemia (AML) patients”’, we compared the survival 
of group 1 and group 2 samples. We found a trend towards poorer 
outcome of group 1 patients with increased early relapse, although 
significance was not reached owing to the small sample number 
(Fig. 2f, P= 0.08). Limiting dilution analyses (LDA) of 11 patient 
samples showed that the leukaemia-initiating-cell frequency in group 
1 samples was 80-fold higher than group 2 samples (Fig. 2g). The 
leukaemia-initiating-cell frequency in one group 1 patient was 11%, 
similar to the leukaemia-initiating-cell frequency previously observed 
in comparable murine models™. Although comparison of absolute 
leukaemia-initiating-cell frequency in xenografts and syngeneic recipi- 
ents is subject to some uncertainty”, the relative difference between 
group 1 and group 2 samples is consistent with their distinct growth 
properties. Moreover, preliminary evidence indicates that leukaemic 
evolution, defined by increasingly aggressive and less restrictive xeno- 
graft growth upon serial passage, also correlates with reduced func- 
tional heterogeneity as reflected by increased leukaemia-initiating-cell 
frequency (Supplementary Fig. 11 and Supplementary Table 4). 
Collectively, we provide the first data linking engraftment properties 
and leukaemia-initiating-cell frequency to both specific genetic events 
in cancer and clinical outcome of patients. 


Clonal dynamics of Ph* ALL pathogenesis 

Despite widespread use of tumour xenografts, there are few studies 
comparing genetic alterations in primary samples versus xenografts. 
To determine whether genetic abnormalities of the diagnostic sample 
are propagated upon transplantation, we tracked the clonal dynamics 
of ALL growth in xenografts by comparing CNA profiles of 12 dia- 
gnosis samples (eight group 1 and four group 2 tumours) with paired 
primary and secondary xenografts (Fig. 2a). Overall, xenografts did 
not select for a specific genotype as no single lesion was acquired by all 
tumours (Supplementary Fig. 12 and Supplementary Table 5). In six 
samples, five of which were group 1, xenografts exhibited the same 
distribution of CNA as the diagnostic patient sample, and detailed 
analysis of the antigen receptor (AgR) loci confirmed that the pre- 
dominant clone present at diagnosis was propagated in xenografts 
(Supplementary Fig. 13). 

By contrast, multiple xenografts derived from the six other patient 
samples (three group 1 and three group 2) harboured distinct genetic 
changes compared to the predominant diagnostic clone (Fig. 2b, c and 
Supplementary Fig. 14), while also sharing major CNA such as IKZF1 
and CDKN2A/B (data not shown). The presence of multiple xenograft 
recipients from the same patient sample with both identical and new 
CNA strongly indicates the existence of subclones, present at low levels 
in the diagnostic sample, that harbour additional genetic alterations. 
Although these CNA are newly emergent, we could detect them in other 
diagnostic samples, pointing to their clinical relevance (Supplementary 
Fig. 12). Our findings mirror recent CNA analysis of matched diagnosis 
and relapse ALL patient samples where the majority of relapsed cases 
represent the evolution of a new clone that is related to, but distinct 
from, the predominant diagnostic clone". Notably, the clinical outcome 
of patients in whom the predominant clone at diagnosis was propagated 
in xenografts (CNA concordant) was different from patients where 
xenografts were engrafted with a minor subclone that outcompeted 
the predominant clone (CNA discordant). CNA concordant samples 
had poorer outcome even within our very small cohort (Fig. 2d). These 
data point to the need for larger validation studies and they indicate that 
genetic alterations, which lead to clonal predominance at diagnosis and 
competitive growth advantage in xenografts, are linked to poor survival. 

To determine whether the detection of minor subclones might be 
hindered by out-competition in xenografts of dominant or aggressive 
clones, we analysed the CNA profiles of two patient samples trans- 
planted at limiting (to engraft single leukaemia-initiating cells) and at 
non-limiting cell doses. In patient 1-8, non-limiting (bulk) cell doses 
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Figure 1 | Functional and genetic analysis of Pht ALL. a, Survival of NS122 
mice transplanted with 20 diagnostic Ph* ALL samples. Xenografts moribund 
before 15 weeks were categorized as group 1 (10 patient samples; n = 90) and the 
remainder that appeared healthy until they were killed for analysis were group 2 
(10 patient samples; n = 45). b, Leukaemic burden in two representative patient 
samples, 1-2 and 2-8, determined by flow cytometry of the injected femur (IF), 
bone marrow (BM, left femur/tibiae), spleen (SP) and peripheral blood (PB) 
(n = 4 mice per sample; error bars, mean + s.e.m.). ¢, Cumulative leukaemia 
engraftment in xenografts from panel a transplanted with group 1 and group 2 
samples (error bars, mean + s.e.m.; ***P < 0.0001). d, Comparison of 
leukaemic engraftment of group 1 (n = 3) and group 2 (n = 3) samples in NOD/ 


generated xenografts that differed at three major CNA compared to 
the patient sample (chromosome (Chr) 1 gain, Chr 8p deletion and 
Chr 8q gain) (Fig. 2e). Xenografts derived from limiting cell doses 
(clonal) differed from both non-limiting xenografts (lacked Chr 1 
gain and 8p deletion) and the patient sample (Chr 8q gain) (Fig. 2e, 
m6). Patient sample 1-1 was highly aggressive in xenografts. All reci- 
pients of non-limiting cell doses and one from the limiting dose group 
contained the major diagnostic clone distinguished with a bi-allelic 
loss of CDKN2A (Fig. 2f, Chr 9). The other recipient from the limiting 
dose group did not have this CNA but retained the larger flanking 
CNA (Fig. 2fand Supplementary Fig. 15, m2). The topography of this 
lesion from each clone, together with the similarity of AgR regions in 
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SCID or NS122/NSG recipients. Human CD45 or CD44 (Supplementary Fig. 
18) was used to evaluate chimaerism. e, CNA analysis of individual group 1 and 
group 2 patient samples using Affymetrix 6.0 SNP arrays. Regions containing 
IKZF1 (top panel), CDKN2A/B (middle panel) and PAX5 (bottom panel) 
deletions are shown. Asterisks indicate focal deletions in IKZF1 involving exons 
3-6, consistent with formation of dominant-negative IKAROS isoform IK6. 
Data are log, ratio, median smoothing format (blue, deletion; white, normal; red, 
gain). See Supplementary Note for patient 1-8. f, Survival analysis of group 1 and 
group 2 patients (P = 0.083). g, Minimum cell dose required for leukaemia 
initiation in NS122 and NSG recipients from group 1 (1 = 5) versus group 2 
(n = 6) patient samples. 


all recipients (Fig. 2e, f, top panels), indicates common genetic ancestry. 
Each clone remained stable after secondary transplantation (Sup- 
plementary Fig. 16). Thus, our data provide formal evidence that 
ALL is composed of genetically distinct subclones that are present in 
varying proportions at diagnosis and have differing functional capacity 
in xenografts. Furthermore, our analysis reveals the sequence of genetic 
events that probably occurred in these patients and provides unique 
insight into the evolution of clonal diversity during ALL leukaemogenesis. 


Multi-clonal model of Ph* ALL pathogenesis 


To gain an insight into the evolutionary processes that underlie the 
generation of dominant and minor subclones, we combined functional 
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Figure 2 | Clonal dynamics of Ph* ALL upon transplant into xenografts. 
a, Schema of experimental design for tracking primary leukaemia clones in 
xenografts using limiting and bulk cell doses. b, Representative patient sample 
displaying a new CNA in each of three xenografts in the HBS1L locus that was 
below SNP array detection limit in the patient sample (additional examples on 
five patients in Supplementary Fig. 14). c, Pictorial representation of outgrowth 
of subclones (CNA discordant) or recapitulation of the predominant diagnostic 
clone (CNA concordant) after transplant into xenografts. d, Survival of patients 
that generated CNA concordant and discordant xenografts. e, f, CNA profiling 
of xenografts transplanted with limiting and bulk doses of patient cells. In 
patient 1-8 (e), multiple CNA present on Chr 1 (middle) and Chr 8 (bottom) 


analysis obtained from xenografts with clonal analysis carried out by 
CNA profiling on multiple xenografts derived from a group 1 and 
group 2 sample. For both patient samples, the dominant diagnostic 
was not detected in xenografts, rather they were repopulated with 
several related but distinct genetic subclones. Detailed tracking of 
CNA provided an unprecedented opportunity to gain an insight into 
the sequence of lesion acquisition in independent subclones. For 
example, in patient 1-6, deletion of the AgR region of Chr 11 in all 
recipients indicates that this change was an early event in disease patho- 
genesis that was shared by all subclones that outgrew after transplant, 
followed by a gain in a region of Chr 6q present in leukaemic cells from 
two xenografts or a deletion at the Chr 14 immunoglobulin heavy chain 
(IGH) locus in a different subclone found in another xenograft (Fig. 3a). 
Patient 2-9 displayed a CNA (gain) in Chr Xp (Fig. 3b, Chr X) at 
diagnosis. All xenografts displayed varying loss of this region, indicating 
that this CNA was acquired early in pathogenesis of this patient sample. 
Only two recipients contained a subsequent Chr 9q deletion (m37 and 
m39), with further clonal divergence leading to a deletion in a region of 
Chr 8q (seen in m37) and duplication of Chr 8q (seen in m39) (Fig. 3b, 
Chr 8 and 9). Interestingly, xenograft m41 also displayed duplication in 
Chr 8q, but lacked a deletion in chromosome 9. Therefore, it remains 
unclear which CNA (Chr 9 deletion or Chr 8q duplication) was 
acquired first in the patient. Data are summarized pictorially for both 
patient samples (Fig. 3, right panel). These data indicate that multiple 
tumour clones coexist in the diagnostic patient sample, and that these 
clones undergo divergent evolution from the diagnostic clone, support- 
ing the branching model of tumour progression””. 
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were detected in all recipients (m1-m4) at bulk cell doses (1 < 10° per mouse) 
that were not detected in the diagnostic sample. At limiting cell dose, one of 
three mice was engrafted (5,000 cells per mouse, m6) and had partial similarity 
to the other bulk recipients (Chr 7 AgR deletion, top panel; Chr 8q gain, bottom 
panel) and the diagnostic sample (Chr 7, AgR deletion). In patient 1-1 (f), two 
of nine mice were engrafted at limiting cell doses (50 cells per mouse; m1, m2) 
and a single recipient at non-limiting cell dose (~1 X 10° cells; m3). The 
diagnostic sample, m1 and m3 share common CNA in CDKN2A, not present in 
m2 (bottom panel). All samples share common CNA at the AgR locus (top 
panel). Data are log, ratio, median smoothing format (blue, deletion; white, 
normal; red, gain). 


Intratumoral heterogeneity may promote clonal evolution by 
increasing the number of selectable traits under any given stress. 
Therefore, genetic diversification is probably important for tumour 
survival. Recently, the degree of genetic diversification has been linked 
to clinical aggressiveness of breast tumours”’, and is associated with 
metastasis in pancreatic cancer**. To determine whether the various 
subclones we identified in patient sample 1-6 could evolve further, two 
primary xenografts were transplanted into secondary recipients. All 
CNA from primary xenografts were detected in secondary recipients, 
indicating overall stability of each subclone (Supplementary Fig. 17a). 
However, new CNA were detected in three of nine recipients, indi- 
cating that ongoing evolution and further progression of disease can 
occur, although it seems to be largely stochastic (Supplementary Fig. 
17b). Although limiting dilution studies were not performed to com- 
pletely rule out the possibility that additional minor subclones, 
undetected in the primary xenograft, contributed to these new CNA, 
these data indicate that genetic diversification can continue in the 
xenograft. 


Discussion 


Here we establish that individual Ph* ALL samples at diagnosis are 
composed of genetically diverse subclones that are related through a 
complex evolutionary process. These subclones vary in their xenograft 
growth properties and leukaemia-initiating-cell frequency. Furthermore, 
Ph* ALL patient samples can be segregated into two subgroups on the 
basis of functional xenograft growth properties and specific genetic 
mutations. The group that lost CDKN2A/B and had a tendency to poorer 
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Figure 3 | Detection of genetically diverse leukaemia-initiating cells in Ph* 
ALL. a, CNA profiling shows that three engrafted recipients from sample 1-6 
share a common CNA on Chr 11 (top) that is not detected in the diagnostic 
sample, but each is distinct for CNA on Chr 6 (middle) and Chr 14 (bottom). 
b, In sample 2-9, all five engrafted recipients shared a deletion (at varying 
degrees) in a region of Chr Xp (top), two recipients displayed a common CNA 
region of Chr 9q (middle; m37, m39), and three recipients had deletions (m37) 
and duplications (m39, 41) on regions of Chr 8p (bottom). A phylogenetic tree 
depicting the relationship between major and minor genetic subclones present 
in the diagnostic sample and a summary of the distinct xenograft subclones are 
shown to the right of each sample. The dashed line represents clonal evolution 
from disease origin (Ph, Philadelphia chromosome). 


survival correlated with aggressive dissemination in xenografts and 
higher leukaemia-initiating-cell frequency. These results are consistent 
with the aggressive, tyrosine-kinase-inhibitor-resistant Ph* ALL seen 
in murine models where Cdkn2a is lost. The discovery that, at diagnosis, 
genetically distinct samples and subclones already possess variably 
aggressive growth properties points to the need to develop effective 
therapies to eradicate all intratumoral genetic subclones to prevent 
further evolution and recurrence. The ability to segregate even minor 
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subclones in xenografts is a powerful tool for the preclinical develop- 
ment of new therapeutic strategies. 

The segregation of individual genetic subclones in xenografts provided 
an opportunity to reconstruct the functional genetic ancestry of sub- 
clones present in diagnostic samples. Our data illustrate that leukaemic 
progression can occur in either a linear or branching fashion, with mul- 
tiple genetic subclones evolving either in succession or in parallel, respec- 
tively. The xenograft growth characteristics of minor subclones were 
distinct: sometimes they out-competed the dominant clone, whereas 
in others (mostly group 1), they did not. The competitive advantage of 
the dominant clone also appeared to associate with poorer clinical 
outcome, although the size of the cohort was small. We speculate that 
some genetic events, such as loss of CDKN2A/B, contribute to clonal 
predominance at diagnosis and competitive xenograft growth. By con- 
trast, the reduced competitive advantage of minor subclones indicates 
that additional genetic events are required for increased aggressiveness. 
However, if minor subclones survive therapy, further evolution and 
expansion could occur, leading to future relapse, consistent with pre- 
vious genetic studies of paired diagnosis and relapse samples''. Because 
gene silencing and other epigenetic events contribute to tumour pro- 
gression, our results also highlight the need for genome-wide methyla- 
tion analysis of individual subclones. Evolution by branching increases 
subclonal complexity and underscores the importance of gaining a 
better molecular understanding of each subclone within a tumour. 

Outgrowth of subclones in serial xenografts can only be sustained 
by leukaemia-initiating cells, and our findings establish that genetic 
diversity occurs in this functionally important cell type. Moreover, the 
discovery that specific genetic events influence leukaemia-initiating- 
cell frequency and that genetically distinct leukaemia-initiating cells 
evolve through a complex evolutionary process indicates that a close 
connection must exist between genetic and functional heterogeneity. 
This has several implications that may bring together the clonal evolu- 
tion and cancer stem cell models. We remain cautious in extrapolating 
our findings in the absence of prospective isolation proving the exist- 
ence of leukaemia stem cells in Ph* ALL; however, strong evidence is 
accumulating for the existence and relevance of leukaemia stem cells 
in other forms of leukaemia” and it is likely that genetically diverse 
leukaemia stem cells will eventually be found. If the leukaemia- 
initiating-cell/leukaemia stem cell link is established, we can speculate 
that leukaemia stem cells are not static but are able to evolve genetically 
and represent units of selection in tumour evolution. As tumours 
evolve, the frequency of leukaemia stem cells increases, eventually 
progressing to a highly advanced state that might no longer adhere 
to a cancer stem cell model. The high leukaemia-initiating-cell fre- 
quency that we observed in some samples and from several murine 
models'®** supports this idea. Finally, as tumours are composed of 
genetically diverse subclones, prospective isolation of leukaemia stem 
cells/cancer stem cells needs to be interpreted with considerable care, as 
fractionation of cancer stem cell and non-cancer stem cell populations 
could segregate genetically distinct subclones with variable tumour- 
initiating-cell capacity as opposed to genetically identical cells with 
differing epigenetic/developmental programs. Because the hierarchy 
model posits that cancer stem cells give rise to non-cancer stem cells, 
future studies must account for subclonal diversity and establish the 
genetic identity of cancer stem cells and non-cancer stem cells. Overall, 
our findings indicate that there may be more commonalities between 
clonal evolution and cancer stem cell models of cancer than previously 
thought and that future studies may lead to a unification of these 
concepts. 


METHODS SUMMARY 

Diagnostic Ph* ALL patient samples were intrafemorally transplanted into female 
NOD.CB17-Prkdc*““/J (NOD/SCID) mice, NOD/SCID mice treated with mouse 
anti-CD122 (as previously described) and NOD.Cg-Prkde*“4 I larg’ WillS7y (NSG) 
mice. Xenograft recipients were monitored for disease sickness, and chimaerism 
was evaluated in various haematopoietic tissues using flow cytometry. DNA copy 
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number alteration (CNA) was carried out with Affymetrix 6.0 SNP arrays on the 
diagnostic patient sample and corresponding xenografts. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Patient samples. Patient samples (primarily peripheral blood) were obtained 
from newly diagnosed Philadelphia-positive acute lymphoblastic leukaemia 
patients according to pre-established guidelines approved by the Research 
Ethics Board of University Health Network. Three out of twenty patient samples 
were donated by the Quebec Leukaemia Cell Bank. All samples were frozen viably 
using standard protocols in FCS + 10% DMSO and stored long term at — 150°C. 
Cell viability, as assessed immediately after thawing, was greater then 90% for all 
cases. Detailed patient data are provided in Supplementary Tables 6 and 7. 
Xenotransplantation assay and analysis. NOD.CB17-Prkdc“"“/] (NOD/SCID) 
and NOD.Cg-Prkde*“Tlarg”"""/SzJ (NSG) mice were bred according to protocols 
established and approved by the Animal Care Committee at University Health 
Network. Ten-to-twelve-week-old old mice were sublethally irradiated at 
225 cGy 24h before transplant. NOD/SCID mice were also treated with mouse 
anti-CD122 monoclonal antibody (NS122) after sublethal conditioning as previ- 
ously described’. Only female mice were used in these studies**. NSG mice were 
periodically genotyped using standard PCR protocols (wild-type common forward 
primer, 5’-GITGGGTAGCCAGCTCTTCAG-3’; wild-type reverse, 5’-CCTGGAGC 
TGGACAACAAAT-3’; mutated reverse, 5'-GCCAGAGGCCACTTGTGTAG-3’). 
All primer sets were obtained from Jackson Laboratories website. 

Frozen leukaemia cells were thawed by drop-wise addition of IMDM + 0.2 mg 
1”! DNase (Roche Applied Science). After centrifugation, cells were resuspended 
in IMDM and counted using ViCell XR (Beckman coulter) or by trypan blue 
exclusion. For transplantation, cells were placed in microfuge tubes and spun 
down to remove excess media. Cells were resuspended in the correct volume 
(25 ul per mouse) for transplant using IMDM + 1% FCS. Intrafemoral transplant 
was performed as previously described”. Briefly, mice were anaesthetized using 
isoflurane. The right knee of mice was bent and drilled with a 27.5-g needle and 
followed by injection of cells with a 28.5-g insulin syringe (BD Biosciences). 

After transplant, animals were monitored for the appearance of disease symp- 
toms, such as weight loss, hunch-back, decreased activity, and killed soon after. 
Mice transplanted with samples that did not induce disease symptoms (in NS122 
or NSG recipients) were killed by 16-24 weeks. Upon death, injected right femur 
(IF), non-injected bones (left femur, right and left tibiae, bone marrow (BM)), 
spleen and peripheral blood were removed and analysed for the presence of 
human leukaemia blasts using flow cytometry. Aliquots of cells were stained in 
96-well round-bottomed plates (BD Falcon). Human cells were distinguished 
from mouse cells using human-specific CD45PC7 and CD44PE (Supplemen- 
tary Fig. 18). B-cell-specific markers (various combinations of IgM FITC, 
CD19 PC5 (Beckman Coulter), CD20 APC7, CD10 APC, CD34 APC7) were 
used to evaluate blast phenotype after transplant. Detection of normal haemato- 
poietic stem cell activity was monitored using CD33 PC5 (Beckman Coulter) or 
APC. T-cell engraftment in NSG mice was monitored using human CD3 FITC or 
APC. All fluorochromes were obtained from BD Biosciences unless otherwise 
indicated. Mice were considered to be engrafted when multiple human leukaemia 
markers (for example, CD44* CD45*CD34* CD19") were detected by at least the 
0.5% threshold. Virtually all cases of engraftment were well above this threshold. 


All flow cytometry analysis was performed on the LSRII (BD Biosciences). The 
remaining marrow (IF + BM) was frozen viably in FCS + 10% DMSO. 

DNA SNP microarray analysis. DNA isolated from patients at diagnosis and 
xenograft samples was analysed using Affymetrix 6.0 SNP arrays. SNP array data 
were analysed using dChip no. 4 software using a reference algorithm and circular 
binary segmentation as previously described'''*"°*°**. To distinguish inherited 
from somatic DNA copy number alterations for primary patient samples lacking 
matched normal DNA, putative variants identified by unpaired segmentation were 
filtered using public copy number polymorphism databases’, and an in-house 
database of SNP array data from several hundred samples. SNP array data are 
available at dbGaP (phs000329.v1.p1) and is also hosted through St Jude 
Children’s Research hospital (http://hospital.stjude.org/forms/genome-down- 
load/request/). Sample information is shown in Supplementary Table 7. 
Quantitative PCR of the CDKN2A/B locus. Primers for genomic quantitative 
PCR were designed using Primer Express 3.0 (Applied Biosystems), and are listed in 
Supplementary Table 3. Taqman RNase P primers (Applied Biosystems) were used 
for control amplification. Fifty nanograms of leukaemic blast DNA or control human 
DNA was amplified using a 7500 Real-Time PCR system and 7500 System Software 
(Applied Biosystems), using the 7500 universal cycling conditions: 50 °C for 2 min, 
followed by 95 °C for 10 min, then 40 cycles of 95 °C for 1 min and 60 °C for 1 min. 
Standard curves for each CDKN2A/B exon and RNase P were generated using 
normal human DNA. Assays were performed in duplicate. CDKN2A/B exon-specific 
copy number values were normalized by dividing the value obtained for each test 
reaction by the paired value obtained for RNase P for each sample. Cutoffs of 0.65 and 
0.3 were used to identify hemizygous and homozygous PAXS deletions, respectively. 
Statistical analysis. All data were analysed using GraphPad Prism version 5.00 for 
Mac OS X (http://www.graphpad.com). The Mann-Whitney U-test was used to 
assess statistically significant differences in chimaerism in xenografts. Clinical out- 
come data was analysed using the Gehan-Wilcoxon method based on random 
10,000 permutations. Because the normal distribution of data can only be assumed 
in larger cohorts, this modified test more accurately computes P-values for small 
patient cohorts. 
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Nascent transcript sequencing visualizes 
transcription at nucleotide resolution 


L. Stirling Churchman! & Jonathan S. Weissman! 


Recent studies of transcription have revealed a level of complexity not previously appreciated even a few years ago, both 
in the intricate use of post-initiation control and the mass production of rapidly degraded transcripts. Dissection of these 
pathways requires strategies for precisely following transcripts as they are being produced. Here we present an approach 
(native elongating transcript sequencing, NET-seq), based on deep sequencing of 3’ ends of nascent transcripts 
associated with RNA polymerase, to monitor transcription at nucleotide resolution. Application of NET-seq in 
Saccharomyces cerevisiae reveals that although promoters are generally capable of divergent transcription, the 
Rpd3S deacetylation complex enforces strong directionality to most promoters by suppressing antisense transcript 
initiation. Our studies also reveal pervasive polymerase pausing and backtracking throughout the body of transcripts. 
Average pause density shows prominent peaks at each of the first four nucleosomes, with the peak location occurring in 
good agreement with in vitro biophysical measurements. Thus, nucleosome-induced pausing represents a major barrier 


to transcriptional elongation in vivo. 


Accumulating evidence now reveals that transcription elongation is 
not a straightforward read-out of the downstream DNA sequence. 
Co-transcriptional processing events dictate the covalent nature and 
fate of RNA transcripts’. Indeed many transcripts are targeted co- 
transcriptionally for rapid degradation and hence are effectively invisible 
to approaches that monitor mature messenger RNAs*™~*. In addition to 
these processing events, the strong propensity of RNA polymerase 
(RNAP) to pause creates barriers to elongation and provides an oppor- 
tunity for regulation and coordination of co-transcriptional events*®. In 
vitro, RNAP pausing is found to be ubiquitous’. Biophysical approaches 
have provided a structural and energetic understanding of RNAP paus- 
ing which results from both intrinsic properties of the polymerase itself 
as well as interactions with its DNA template, including the presence of 
bound proteins (for example, histones)*”. In the cell, elongation factors 
probably alter the energetic landscape of transcription, but the extent 
and mechanism of RNAP pausing in eukaryotic cells remain largely 
unknown. Bridging the divide between in vivo and in vitro transcrip- 
tional views requires approaches that visualize transcription with com- 
parable precision afforded by in vitro transcriptional assays. More 
generally, the ability to monitor quantitatively nascent transcripts would 
provide broad insights into the roles and regulation of transcription 
initiation, elongation and termination in gene expression. 

Historically, two strategies have been used to provide snapshots of 
transcriptional activity in vivo. In the first approach, RNAP is crosslinked 
to DNA and RNAP-bound DNA elements are identified by microarrays 
or deep sequencing’*"*. Although providing a global view of RNAP 
binding sites, these measurements are of limited spatial and temporal 
resolution and do not reveal the identity of the transcribed strand or even 
whether RNAP molecules are engaged in transcription. In the second 
approach, transcription is halted in vivo and then reinitiated in isolated 
nuclei under conditions that allow labelling of nascent chains, thereby 
enabling them to be distinguished from bulk RNA’*’*. Such ‘nuclear 
run-on’ strategies reveal actively transcribed DNA regions but require 
extensive manipulations that limit resolution and depend on the efficient 
re-initiation of transcription under non-physiological conditions. 


To monitor the transcriptional states of unperturbed cells, we sought 
to determine the precise in vivo position of all active RNAP complexes. 
Here we present an approach (native elongating transcript sequencing, 
NET-seq) that accomplishes this goal by exploiting the extraordinary 
stability of the DNA-RNA-RNAP ternary complex” to capture nascent 
transcripts directly from live cells without crosslinking. The identity and 
abundance of the 3’ end of purified transcripts are revealed by deep 
sequencing”, thus providing a quantitative measure of RNAP density 
with single nucleotide precision. Using NET-seq, we expose rapidly 
degraded transcription products, locate the position of RNAP pauses, 
and identify factors and chromatin structure that regulate these tran- 
scription events. 


Quantifying transcription at nucleotide resolution 
We focused on the transcription by RNAPII of protein-coding genes in 
the budding yeast Saccharomyces cerevisiae, although the NET-seq 
approach should be readily adaptable to other systems. To facilitate 
purification, we worked with a strain that endogenously expressed a 
functional variant of RNAPII with a 3X-Flag epitope attached to its 
third subunit (Rpb3). Log-phase cultures were collected by filtration 
and flash frozen in liquid nitrogen (Fig. la). After cryogenic lysis, 
RNAPII was efficiently immunoprecipitated (Supplementary Fig. 1a). 
We prepared the co-purified RNA for deep sequencing using a protocol 
that allows efficient RNA capture while minimizing bias’, and 
sequenced 40 bases from the 3’ end. The alignment of these sequences 
to the yeast genome identified the final nucleotide that was incorpo- 
rated by RNAPII, and the number of sequencing reads at each position 
along the genome indicated the density of transcriptionally active RNA 
polymerases at that site (Fig. 1b, alignment statistics displayed in 
Supplementary Table 1). A metagene analysis of RNAPII distribution 
across transcription units shows higher RNAPII density for the first 
700 base pairs (bp) from the 5’ end (Fig. 1c), consistent with lower 
resolution observations seen using a global run-on approach’*. 
Several observations indicate that we are detecting nascent transcrip- 
tion. First, we robustly capture transcripts from introns and regions 


1Department of Cellular and Molecular Pharmacology, Howard Hughes Medical Institute, University of California, San Francisco and California Institute for Quantitative Biosciences, San Francisco, 


California 94158, USA. 


368 | NATURE | VOL 469 | 20 JANUARY 2011 


©2011 Macmillan Publishers Limited. All rights reserved 


mG 


Sy 


Freeze, vee | 


IP 
mG 


Library generation | 


RNAPII pNA 
‘OH 


mG 
RNAQ ag 


Sequencing primer 
q 9 Pp 


5’ Linker — Insert 3’ Linker 


Figure 1 | NET-seq visualizes active transcription via capture of 3’ RNA 
termini. a, Schematic diagram of NET-seq protocol. A yeast culture is flash 
frozen and cryogenically lysed. Nascent RNA is co-purified via an 
immunoprecipitation (IP) of the RNAPII elongation complex. Conversion of 
RNA into DNA results in a DNA library with the RNA as an insert between 
DNA sequencing linkers. The sequencing primer is positioned such that the 3’ 
end of the insert is sequenced. m’G refers to the 7-methylguanosine cap 
structure at the 5’ end of nascent transcripts. b, The 3’ end of each sequence is 


after polyadenylation sites; areas that are present in nascent transcripts 
but absent from mature messenger RNAs (Fig. 1b). Second, we verified 
that transcripts do not associate with RNAPII after cell lysis (Sup- 
plementary Table 2). Third, we saw negligible degradation of RNA 
under the immunoprecipitation conditions. Nevertheless, our library 
generation protocols prevent detection of co-purified degradation pro- 
ducts by requiring that input RNAs have 3’ hydroxyl termini, as hydro- 
lysis and degradation products primarily have terminal phosphates”. 
Finally, we saw that transcription did not proceed during processing of 
lysates as addition of the transcription inhibitor %-amanitin to the lysis 
buffer did not change the RNAPII density (Fig. Ic). 

In addition to nascent transcripts, the RNAPII immunoprecipitation 
captures splicing intermediates (that is, the 5’ exon and the excised 
lariat). Their 3’ hydroxyl termini allow them to appear in our data at 
the 3’ ends of exons and introns (Supplementary Fig. 5). These observa- 
tions indicate the widespread existence of co-transcriptional splicing in 
yeast and establish NET-seq as a powerful tool for studying such events. 


Direct observation of transcription of unstable RNA 

NET-seq monitors transcripts regardless of their stability, making it 
ideally suited to the analysis of unstable transcripts. Recent studies have 
revealed a class of cryptic unstable transcripts (CUTs) that are short 
(less than ~700 nucleotides), upstream and antisense to an annotated 
gene and rapidly degraded by the exosome**’”’. Divergent transcrip- 
tion, yielding the production of antisense CUTs and mature messenger 
RNAs from the sense direction, is seen at many promoters in both yeast 
and metazoans. The observation of widespread divergent transcrip- 
tion was surprising and it remains unclear how antisense transcripts 
initiate and what biological function they may have. It is likely that the 
nucleosome-free region associated with promoters facilitates antisense 
transcription. Additionally, it has been suggested that antisense and 
sense transcription levels are co-dependent’*”’, as transcription in the 
sense direction could promote upstream antisense transcription (and 
vice versa) by creating negatively supercoiled DNA and recruiting 
factors that set permissive histone marks™. Critical evaluation of these 
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mapped to the yeast genome and the number of reads at each nucleotide is 
plotted at the RPL30 locus for nascent RNA and lightly fragmented mature 
RNA. Note that for the nascent transcripts, the introns (grey box) and regions 
after the polyadenylation site (black arrow) are readily detected. c, Metagene 
analysis for well-expressed genes (n = 471, >1.5 reads per bp in both 
conditions) of the mean read density (arbitrary units, a.u.) in the presence and 
absence of transcription inhibitor, «-amanitin. TSS, transcription start site. 


hypotheses has been limited by the difficulty in quantitatively moni- 
toring the levels of unstable antisense transcripts. 

As NET-seq directly monitors the production of transcripts, we 
were able to quantify the relative amounts of nascent sense and anti- 
sense transcripts (Fig. 2a, b). We focused our analysis on promoters 
between genes encoded on the same strand (tandem genes), because in 
those instances, antisense transcripts can be clearly differentiated from 
the stable upstream transcript. To quantify divergent transcription, we 
integrated the transcript levels for the first 500 bp of transcribed DNA 
in each direction. Although we clearly observed divergent promoters, 
the large majority of promoters had much less antisense transcription 
than sense transcription; for more than half of the promoters, sense 
transcription was at least eight times higher than antisense transcrip- 
tion, and for 80% of the promoters the sense-to-antisense transcription 
ratio exceeded threefold (Fig. 2b). Notably, a comparison between the 
levels of sense and antisense transcription showed only modest cor- 
relation (Spearman correlation coefficient, r, = 0.34) (Fig. 2c). 

The above analysis establishes that antisense transcription is not an 
obligatory consequence of having an active promoter. What then 
dictates whether a promoter is directional? Transcription initiation 
is known to occur in nucleosome-free regions; however, we failed to 
see a correlation when we compared antisense transcription levels 
with published data” reporting on promoter nucleosome-free-region 
size and promoter average nucleosome occupancy (Supplementary 
Fig. 2a, b). We also investigated whether histone modifications assoc- 
iated with active promoters correlated with antisense transcription, as 
it was observed that H3 acetylation peaks in regions of antisense 
transcription in human fibroblasts’*. Notably, we found a strong pos- 
itive correlation (r, = 0.65) between antisense transcription levels and 
earlier measurements of the levels of H4 (and to a lesser extent H3) 
acetylation enrichment™ (Fig. 2d and Supplementary Fig. 2c, d). 


Rpd3S promotes promoter directionality 
The strong correlation between antisense transcription and H4 acet- 
ylation indicates that H4 acetylation may have a causative role in 
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Figure 2 | Observation of divergent transcripts 
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facilitating antisense transcription. To test this, we examined the 
effect on antisense transcription of loss of RCO1, a required and 
dedicated subunit of the Rpd3 small (Rpd3S) H4 deacetylation com- 
plex**’*. We focused on Rpd38, as earlier studies had shown that it 
contributes to deacetylation of H4 in the 3’ region of transcripts and 
the large majority of antisense transcripts overlap the 3’ ends of 
upstream genes. Previous global studies of Rpd3S monitored accu- 
mulation of mature stable RNAs and so would not detect the effects of 
Rcol on transient RNA species**”*. Our analysis revealed a pervasive 
increase (average fourfold) in unstable antisense transcription 
(Fig. 3a, b). This effect was the dominant transcriptional phenotype 
that we observed and was specific to antisense transcription: we found 
no systematic increase in RNAPII density at the beginning of sense 
transcripts (Supplementary Fig. 3). Importantly, antisense transcripts 
seen in the rco1/ strain have the same transcription start sites and the 
same lengths as the wild-type transcripts, indicating that Rcol is 
acting at the initiation stage of antisense transcription and does not 
affect termination (Fig. 3c). Additionally, we observed that deletion of 
EAF3, another subunit of Rpd3S, mimicked the increases seen in the 
rcolA data (r, = 0.88, Supplementary Fig. 4). Thus, the primary func- 
tion of the Rpd3S histone deacetylase complex seems to be to enforce 
promoter directionality. 

This raises the question of how Rpd3S is recruited to positions 
designated for suppression of antisense transcription. The Rcol and 
Eaf3 components of the Rpd3S complex bind H3 lysine 36 methyla- 
tion marks made by Set2 and that binding activates the deacetylase 
activity of Rpd3S (refs 25-27). However, a distinct RNAPII-associated 
methyltransferase, Set1, has also been implicated in Rpd3S-dependent 
repression”’. Moreover, even in the absence of methylation, RNAPII is 
capable of recruiting Rpd3S to gene bodies during transcription”. 

To investigate how Rpd3S is localized to suppress antisense tran- 
scription, we analysed nascent transcripts in cells lacking Set1 or Set2. 
SETI deletion caused a weak increase in antisense transcription in a 
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manner that correlated only modestly with the rcolA and eaf3A data 
(r, = 0.36 and r, = 0.38 respectively; Supplementary Fig. 5). In con- 
trast, deletion of SET2 led to a pronounced increased in antisense 
transcription that was highly correlated with the rcolA and eaf3A 
data (r, = 0.88 and r,=0.89 respectively; Supplementary Fig. 5). 
These data together with earlier work on the Set2/Rpd3S pathway 
indicate that the major mechanism for Rpd3S action on antisense 
transcription involves Set2 recruitment to elongating RNAPII via 
Ser 2 phosphorylation on its carboxy-terminal domain”. This in turn, 
through the Set2 methylation activity, allows recruitment of Rpd3S to 
the 3’ ends of genes, suppressing antisense transcription from down- 
stream nucleosome-free regions. Future challenges will be to explain 
how histone acetylation in the body of antisense transcripts can affect 
transcription initiation, and to determine other mechanisms that localize 
Rpd3S, particularly for the handful of antisense transcripts that do not 
overlap the 3’ ends of genes. 


Pausing occurs throughout transcription elongation 


The ability of NET-seq to map the density of nascent transcripts 
enables in-depth investigation of the extent and sources of RNAP 
pausing in vivo. Our data revealed strong and highly reproducible 
spikes in the density of 3’ ends of nascent transcripts along a given 
gene indicative of RNAPII pause sites (for example, GPM1; Fig. 4a). 
We developed an algorithm to identify RNAPII pause positions that 
finds points where the read density is at least three standard deviations 
above the mean in a local 200-bp window. We found that pauses occur 
frequently throughout the body of RNA messages and are evenly 
distributed after the first ~700 bp (Fig. 4b and Supplementary Fig. 
6). The high density of pauses was not an artefact of library generation 
and sequencing biases, as we detected tenfold fewer spikes in data 
from messenger RNA lightly fragmented by alkaline hydrolysis 
(Supplementary Fig. 7). Notably, 70% of the more than 2 X 10° pause 
sites that we identified had an A at the 3’ end of the transcript. 
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Additionally, there was a preference for the pause to be followed imme- 
diately by a T and then G (Fig. 4c). None of these biases was seen in the 
control sample of fragmented mRNA (Supplementary Fig. 7a). 

Largely from in vitro studies, one mechanism of RNAP pausing has 
been shown to involve backtracking: after encountering a blockage, 
RNAP reverses direction and moves upstream*". In the backtracked 
state, the 3’ end of the RNA transcript is no longer aligned with the 
active site and RNAP must either return to the initial pause site or 
cleave the transcript. The latter option is aided by the presence of the 
elongation factor TFIIS (Dstl in yeast) that enhances RNAP’s 
intrinsic RNA cleavage activity (Fig. 5a)’**. Although the role of 
TFUS is well established in vitro, its mechanism in vivo has been less 
explored****, 

To investigate the role that backtracking has in pausing in vivo, we 
deleted DST1 and repeated the NET-seq assay. Notably, we saw a 
large-scale downstream shift in the position of the pauses, an average 
of 5-18 bp (Fig. 5b, c). This shift was observed for ~75% of the pauses 
(Supplementary Fig. 8) and was accompanied by a global change in 
the sequences surrounding pause sites; the preference for A at the 
pause was lost and instead there was a strong preference for T imme- 
diately downstream of the pause (Fig. 5d). These observations con- 
firm that the observed spikes in NET-seq data result from RNAPII 
pausing, and indicate that pausing followed by backtracking—which 
previously had been observed at promoter-proximal pauses**—is pre- 
valent throughout the body of transcripts. Additionally, our studies 
indicate that Dstl-stimulated RNA cleavage has a strong sequence 
bias and that a slow step follows cleavage before transcription 
resumes. 
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Figure 3 | Rcol suppresses antisense transcription at divergent promoters. 
a, Examples of cryptic unstable transcripts (CUTs, orange data) upstream and 
antisense of DBF2, DRNI and VAS1 promoters. The fold increase of CUT 
transcription in the rcolA strain is marked. b, The transcription ratio 
(antisense/sense) in the rco1J strain is plotted against the transcription ratio in 
the wild-type strain for each gene. c, A metagene analysis of well-expressed 
antisense transcription (nm = 171, >1 read per bp). 
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RNAPII pause density peaks before the nucleosome dyad 


The pauses observed in the dst1/ strain reveal positions where 
RNAPII began to backtrack and, therefore, represent the primary 
point of transcriptional blockage. By analysing these pause positions, 
we can evaluate what induced RNAPII to backtrack. In vitro, nucleo- 
somes induce RNAPII backtracking and TFIIS aids the progression of 
RNAPII through them’®””. In vivo, it is unknown whether nucleo- 
somes interfere with transcription, as chromatin remodelling factors 
could greatly diminish the nucleosome barrier or remove nucleo- 
somes before RNAPII arrival*”°*. Global high-resolution measure- 
ments of steady-state nucleosome occupancy revealed that the first 
few nucleosomes after the transcription start site are phased and well 
positioned”. Thus, by correlating the relative density of RNAPII 
pauses with nucleosome positions, we can evaluate whether nucleo- 
somes promote RNAPII pausing in vivo. 

We compared the pause positions in the dst1/ strain to the centre 
positions of nucleosomes using previously published data*’. Notably, 
we saw marked peaks of mean pause density at each of the first four 
nucleosomes (Fig. 6). The precise position of the point of maximal 
RNAPII pausing at the +1 nucleosome is obscured because it is 
located just after the transcription start site where many nascent 
transcripts are too short for unique alignment to the genome. For 
the +2, +3 and +4 nucleosomes, however, the pause density peaks 
just before the nucleosome dyad axis (Fig. 6). As would be expected 
from RNAPII backtracking, the excess pause density before the 
nucleosome dyad in the wild-type strain is spread out over the 
upstream region (Supplementary Fig. 9). 

Our finding that the peak in pause density occurs just before the 
nucleosome dyad is particularly remarkable as it is in excellent agree- 
ment with earlier biophysical measurements. Specifically, optical 
trapping studies that physically unwrapped the DNA ofa nucleosome 
off the histone core observed that the dyad is the point where the 
strongest DNA-histone contacts are found”. Moreover, high-resolution 
optical trapping experiments that followed RNAPII transcribing 
through a nucleosome found that the RNAPII pause density peaked 
before the nucleosome dyad’°. Taken together, the above observations 
provide strong evidence that nucleosomes do indeed present a barrier to 
elongating polymerases in vivo and that this barrier leads to polymerase 
pausing and backtracking. 
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Figure 4 | Frequent RNAPII pausing throughout gene bodies. a, NET-seq 
data at the GPM1 gene for biological replicates. b, A histogram of the mean 
distance between pauses for each well-expressed gene (n = 1,006, >2 reads per 
bp). ¢, The consensus sequence of the DNA coding strand surrounding pause 
sites found from all genes. 
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Figure 5 | Dst1 relieves RNAPII pausing after backtracking. a, A schematic 


describing an existing model for how RNAPII pauses at an obstacle (red 
square), backtracks and is induced to cleave its transcript through binding to 
Dst1 (refs 32, 33). b, A comparison of NET-seq data for wild-type and dst1A 
strains at the GPM1 gene. c, Mean cross-correlation between the dst1A and 
wild-type data of well transcribed genes (n = 770, >2 reads per bp) (green line) 
was calculated by determining the Pearson’s correlation coefficient at each gene 
between fixed dst1A data and shifted wild-type data followed by averaging over 
all genes. This analysis is compared to the mean autocorrelation of the wild- 
type data for well transcribed genes (blue line). d, The consensus sequence for 
all pauses observed in the dst1A strain. 


Perspective 

One of the major surprises in the transcription field in recent years has 
been the widespread observation of divergent transcription, revealing 
that the majority of promoters engage in canonical transcription in the 
sense direction along with the production of unstable transcripts in the 
antisense direction” *'°?'. NET-seq provides an ideal tool to look at this 
phenomenon and uncovers several fundamental properties of diver- 
gent transcription. First, most promoters show a strong directionality 
favouring the sense transcript. Second, suppression of antisense tran- 
scripts is enforced by two distinct mechanisms: Rpd3S-mediated 
deacetylation that prevents antisense initiation, and an independent 
mechanism, previously characterized to involve the Nrd1-Nab3-Sen1 
complex’, that terminates antisense transcripts and shuttles them to 
the exosome for degradation. Interestingly, sense transcription may 
also use this termination mechanism, as our data showed an enrich- 
ment for transcripts at the 5’ end of genes that mirrors what we 
observed for antisense transcripts and complements observations that 
Nrd1 localizes to the 5’ end of genes*’. Third, our observations indicate 
independence between the initiation of the sense and antisense tran- 
scripts. Specifically, we found only modest correlation between sense 
and antisense transcription levels. Moreover, even among the set of 
antisense transcripts that increased when RCO1 is deleted, no increase 
in sense transcription levels was seen. These findings argue against 
models in which antisense transcription serves to promote sense 
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Figure 6 | Nucleosomes are a major barrier to transcription. Plot of mean 
pause densities in dst1A data relative to the first four nucleosomes after the 
transcription start site using available nucleosome positioning data”*. Error bars 
represent one standard deviation. 


transcription (for example, by unwinding DNA supercoils or by 
removing nucleosomes). 

The potential for RNAP to pause has been apparent for decades, 
motivating interest in the mechanisms and regulatory roles of pausing 
in the process of transcription””"*'*"°, NET-seq provides the first in- 
depth view of pausing in a eukaryotic cell, revealing that transcription 
is punctuated by pauses throughout the body of all RNA messages. 
Taking into account both the abundance and magnitude of the pauses, 
we conclude that RNAPII spends comparable time in a paused state 
and moving forwards (Supplementary Fig. 10). We establish that 
nucleosomes induce pausing in vivo, and may be the major source 
of pausing considering that the increase in pause density at nucleo- 
somes is comparable to the increase in nucleosome occupancy”’. Our 
observation that pausing peaks at the nucleosome dyad reveal a strik- 
ing similarity between our measurements and optical trap measure- 
ments, indicating that the physical forces observed in purified in vitro 
systems are at play in the cell. NET-seq’s ability to follow the physical 
basis of transcription in vivo, allowing direct comparison with high- 
resolution in vitro measurements, may prove to be the most trans- 
formative aspect of this approach. 


METHODS SUMMARY 

Nascent RNA purification. All experiments were conducted using derivatives of 
yeast strain BY4741. Epitope-tagged Rpb3 (C-terminal 3X-Flag) was expressed 
from its endogenous locus. Deletion strains were made by standard PCR-based 
methods. Litres of log phase culture in YEPD were harvested by filtration and flash 
frozen by plunging into liquid nitrogen. Frozen cells were lysed cryogenically via 
six cycles of pulverization using a mixer mill. 

Clarified and DNase-I-digested lysate was added to washed anti-Flag M2 affinity 

gel (Sigma Aldrich), incubated at 4°C and nutated for 2.5 h. After washing, bound 
proteins were eluted twice with 2 mg ml’ 3X-Flag peptide (Sigma Aldrich). RNA 
from the eluates was purified using the miRNeasy kit (Qiagen). 
RNA linker ligation, cDNA synthesis and PCR. An RNA linker that was 5’ 
adenylated and 3’-end blocked with a dideoxy-C base (5’-CTGTAGGCACC 
ATCAAT, Integrated DNA Technologies) was ligated onto the 3’ end of the 
immunoprecipitated RNA based on a previously described strategy**. Ligation 
conditions (see Methods) were systematically optimized to maximize ligation 
efficiency to ~90% to ensure that the majority of the input RNA was ligated 
and thus avoiding any bottleneck biases. 
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cDNA synthesis and sequencing was performed as described with a few modi- 
fications’’. The sequencing primer binding site was positioned so that sequencing 
would start at the 3’ end. 
Comparing pause densities to nucleosome positions. Nucleosome positions” 
were assigned as +1, +2, +3 etc according to their position relative to transcrip- 
tion start sites. The mean pause density (MPD) relative to a particular nucleo- 
some was determined by the number of pauses observed at that position (N,) 
divided by the total number of opportunities it could be observed there (N,): 


all genes 


> gi(y) 


MPD a : 
x) No «(Benes with TSS<y 
1 


yank+x 


where k is the nucleosome number, g(y) is the binary function indicating whether a 
pause occurs at y, and né are the centre nucleosome positions. The error of the pause 
density was calculated via the standard deviation of the binomial distribution: 


Np(1 —r/y,) 
No 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Strain construction. All experiments were conducted using derivatives of yeast 
strain BY 4741. Epitope-tagged Rpb3 (C-terminal 3X -Flag) was expressed from its 
endogenous locus. Deletion strains were made by standard PCR-based methods. 
Extract and total RNA preparation. Yeast strains were grown in YEPD at 30°C 
with shaking from an initial optical density (OD) of 0.1 to mid-log phase with an 
OD of 0.6-0.8. Two litres of yeast culture were harvested in turn by filtration onto 
0.45-j1m-pore-size nitrocellulose filters (Whatman). The culture was scrapped off 
the filter with a spatula pre-chilled by liquid nitrogen and flash frozen by plunging 
into liquid nitrogen. Frozen cells were pulverized for six cycles, each of 3 min at 
15 Hz, on a Retsch MM301 mixer mill. Sample chambers were pre-chilled in 
liquid nitrogen and re-chilled between each pulverization cycle. 

One gram of ground cells (~ 1] at 0.7 OD) was added to 5 ml of ice-cold lysis 
buffer (20 mM HEPES, pH 7.4, 110 mM KOAc, 0.5% Triton X-100, 0.1% Tween 
20, 10mM MnCh, 50U ml ' SUPERaseeIn (Ambion)) supplemented with 
protease inhibitor cocktail (1x Complete, EDTA-free, Roche). The experiment 
using o.-amanitin included 10 pg ml * «-amanitin (Sigma Aldrich) in the lysis 
buffer. After re-suspending the lysate by pipetting, 660 units of DNase I 
(Promega, RQ1 RNase-Free DNase) was added and incubated for 20 min on 
ice. The lysate was then clarified by centrifugation at 4°C at 20,000g for 
10 min. The supernatant is reserved for immunoprecipitation. 

Two-hundred microlitres of clarified lysate is reserved for total RNA purifica- 
tion which was done by the hot acid phenol method. Typical yields were 20 pg. 
Native affinity purifications of RNAPII. 0.5 ml of Anti-Flag M2 Affinity Gel 
(Sigma Aldrich) was washed twice with lysis buffer. The clarified lysate was added 
to the washed gel, incubated at 4 °C and nutated for 2.5 h. The immunoprecipita- 
tion was washed 4 X 10 ml with wash buffer (20 mM HEPES, pH 7.4, 110 mM 
KOAc, 0.5% Triton X-100, 0.1% Tween 20, 50 U ml _' SUPERasesIn (Ambion), 1 
mM EDTA). Bound proteins were eluted twice with 150 pl elution buffer (20 mM 
HEPES, pH7.4, 110mM KOAc, 0.5% Triton X-100, 0.1% Tween 20) with 2 
mgml | 3X-Flag peptide (Sigma Aldrich). RNA from the combined eluates 
was purified using the miRNeasy kit (Qiagen, 217004). A typical yield from 
approximately one litre of log-phase yeast culture was 3 1g. 
mRNA purification and fragmentation. Polyadenylated mRNA was purified 
from 50 1g total RNA using magnetic oligo-dT DynaBeads (Invitrogen). Purified 
RNA was eluted in 20 pil 10 mM Tris, pH 7.0. The purified mRNA was mixed with 
an equal volume of 2X alkaline fragmentation solution (2mM EDTA, 10mM 
Na CO3, 90 mM NaHCOs, pH ~ 9.3) and incubated for 5 min at 95 °C. These 
conditions yielded lightly fragmented RNA of size distribution similar to that of 
the nascent RNA. The fragmentation reaction was stopped by the addition 
0.56 ml of ice-cold precipitation solution (final 300 mM NaOAc pH5.5, plus 
GlycoBlue (Ambion) as a co-precipitant) and RNA was purified by a standard 
isopropanol precipitation, as follows: after adding 650 il of isopropanol, samples 
were placed at —30°C for at least 30 min. Precipitated RNA was pelleted by 
centrifugation at 4°C at 20,000g for 30 min. The pellet was air dried after a quick 
wash with 80% ethanol and then re-suspended in 10 mM Tris pH 7.0. 

A total of 6.4 1g of fragmented mRNA was dephosphorylated in a 50 pil reaction 
with 1X T4 polynucleotide kinase buffer without ATP, 0.5U SUPERaseeIn 
(Ambion) and 22.5 units T4 polynucleotide kinase (NEB). The dephosphorylation 
reaction was incubated at 37 °C for 1h followed by 10 min at 75 °C for enzyme heat 
inactivation. RNA was precipitated with GlycoBlue by standard methods (see above). 
RNA linker ligation, fragmentation and size selection. An RNA linker that was 
5’ adenylated and 3’-end blocked with a dideoxy-C base (5’-CTGTAGGCACCA 
TCAAT, Integrated DNA Technologies) was ligated onto the 3’ end of the 
immunoprecipitated RNA, the fragmented mRNA and a synthetic 28-base 
RNA oligonucleotide (oNTI199, 5’-AUGUACACGGAGUCGACCCGCAACG 
CGA) similarly to what has been described’. Specifically, 3 ug of each RNA 
sample was broken into three reactions and diluted to 10 ul with 10 mM Tris, 
pH 7.0. After a brief denaturation the reactions were brought to 20 pl with a buffer 
that gave final concentrations of 12% PEG8000, 50 ng wl? linker, 1< T4 Rnl2, 
truncated reaction buffer and 2 units pl! of T4 Rnl2, truncated (NEB). The 
reaction was incubated at 37 °C for 3h. Ligation conditions were systematically 
optimized to maximize ligation efficiency to ~90% to ensure that the majority of 
the input RNA was ligated. 

Fragmentation of the ligated samples allowed for the final DNA library to 
contain inserts of a narrow range to reduce any length biases of downstream 
enzymatic reactions. EDTA was added to all reactions for a final concentration of 
17mM. 20,1 of 2 alkaline fragmentation solution (2mM EDTA, 10mM 
Na2CO3, 90 mM NaHCOs;, pH ~ 9.3) was added to each reaction and incubated 
at 95 °C for 30 min. The reactions were stopped by the addition of 0.56 ml of ice- 
cold precipitation solution (final 300mM NaOAc pH5.5, plus GlycoBlue 
(Ambion) as a co-precipitant), followed by a standard isopropanol precipitation 
(see above). 


The ligated and fragmented samples were size-selected by gel electrophoresis. The 
purified reactions along with the oNTI199 RNA oligonucleotide was mixed with 
2X Novex TBE-Urea sample prep buffer (Invitrogen) and briefly denatured, then 
loaded on a Novex denaturing 15% polyacrylamide TBE-urea gel (Invitrogen) and 
run according to the manufacturer’s instructions. The gel was stained with SYBR Gold 
(Invitrogen) and the 35-85-nucleotide region was excised. The gel was physically 
disrupted and either allowed to soak overnight in gel elution buffer (300 mM 
NaOAc pH5.5, 1mM EDTA, 0.1 Up! SUPERase*In) or incubated in 200 ul of 
water treated with diethylpyrocarbonate (DEPC) for 10 min at 70 °C. The gel debris 
was removed from the water or buffer using a Spin-X column (Corning) and RNA was 
precipitated with GlycoBlue as a co-precipitant using standard methods. 
cDNA synthesis. cDNA synthesis was performed as described with a few modi- 
fications’. The primer used for reverse transcription was oLSC003 (5’-pT'CG 
TATGCCGTCTTCTGCTTGe-AATGATACGGCGACCACCGATCCGACGAT 
CATTGATGGTGCCTACAG) where the initial ‘p’ indicates 5’ phosphorylation 
and ‘*’ indicates the following spacer added for increased flexibility, 18 carbon 
spacer molecule-CACTCA- 18 carbon spacer molecule. Efficient circularization of 
the RT product was performed as described’ with CircLigase (Epicentre) accord- 
ing to the manufacturer’s directions. Any ligation bias at this step is averaged out as 
the random fragmentation leaves a range of 5’ ends for each 3’ end. The PCR was 
performed directly on the circularized product as described”’, resulting in DNA 
with Illumina cluster generation sequences on each end and a sequencing primer 
binding site positioned so that sequencing would start at the 3’ end. DNA was 
purified from a PCR reaction that had not reached saturation and was quantified 
using the Agilent BioAnalyser High Sensitivity DNA assay. DNA was then 
sequenced on the Illumina Genome Analyser 2 according to the manufacturer’s 
instructions, using 4-6 pM template for cluster generation and sequencing primer 
oLSC006 (5'-TCCGACGATCATTGATGGTGCCTACAG). 

Data analysis. Data analysis was performed using scripts written in Python 2.6 
that are available upon request. 

Sequencing analysis. Image data obtained by the Illumina Genome Analyser 2 
was analysed using the GA Pipeline to extract raw sequences. Matrix and phasing 
parameters were estimated from a $X control lane. 

Sequence alignment. Raw sequences 40 bases long were composed of the cDNA of 
the fragmented RNA sequence. For RNA fragments smaller than 40 bases, the 
sequence is followed by part of the 5’ Illumina linker sequence which was removed 
in silico. Alignments to the yeast genome were performed by the alignment pro- 
gram, Bowtie 0.12.0" (http://bowtie-bio.sourceforge.net/). Bowtie settings were 
chosen so that three mismatches were allowed and alignments were required to 
be unique. The shortest sequenced fragments were approximately 18 nucleotides 
due to the RNA size selection step after ligation and random fragmentation. 
Eighteen-base-pair sequences would occur by chance every 6.9 X 10'° bp, which 
is sufficiently rare for 18-bp sequences to be generally uniquely aligned to the 
1.2 X 10” bp yeast genome. Alignments were first performed against tRNA and 
rRNA sequences to remove them. The remaining sequences were aligned against 
a recent version of the yeast genome downloaded from the Saccharomyces Genome 
Database (SGD, http://www.yeastgenome.org/) on 11 October 2009. Statistics on 
sequence alignments are reported in Supplementary Table 1. 

Quantifying antisense and sense transcription levels. At tandem promoters 
sense transcription was determined using available annotated transcription start 
sites*. To allow for the error involved in these transcription start site measurements, 
we calculated the sum of the read density in 500-nucleotide windows for the first 
700 bases after the transcription start site and chose the highest sum. The antisense 
transcription was determined by starting 100 bases upstream of the transcription 
start site and the read density sum in 500-nucleotide-wide windows was calculated 
for the subsequent 1,000 bases. The highest sum was used for downstream analysis. 
Metagene analysis. Each gene included in the analysis is normalized by the mean 
number of reads in a 400-bp window beginning 100 bases downstream from the 
transcription start site. A mean read density (MRD) is then calculated for each 
position over all genes as described below. 
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where rj are the reads for the jth gene at the ith position after the transcription 
start site. 

Extracting pause positions. Pauses were identified in previously annotated tran- 
scription units’ of well-expressed genes. Pauses were defined as having reads 
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higher than three standard deviations above the mean of the surrounding 200 
nucleotides which do not contain pauses. Pauses were required to have at least 
four reads regardless of the gene’s sequencing coverage. Sequence consensus was 
calculated by WebLogo 3 (http://weblogo.threeplusone.com/)*. 

Comparing pause densities to nucleosome positions. Nucleosome positions” 
were assigned as +1, +2, +3 etc according to their position relative to transcrip- 
tion start sites. The mean pause density (MPD) relative to a particular nucleo- 
some was determined by the number of pauses observed at that position (N,) 
divided by the total number of opportunities it could be observed there (N,): 
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where k is the nucleosome number, g(y) is the binary function indicating whether 
a pause occurs at y, and nj are the centre nucleosome positions. For the +2 and 
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+3 nucleosomes, the number of pause opportunities was uniform at every posi- 
tion and was simply the number of genes included in the analysis. The +1 
nucleosome analysis required that the number of pause opportunities at each 
position represent the number of genes where that position occurs after the 
transcription start site. The error of the pause density was calculated via the 
standard deviation of the binomial distribution 


Np (1 —Np Ww.) 
No 


The densities were then binned by averaging across windows ten nucleotides 
wide. The error for each bin was calculated by computing the sum of the variances 
of the binned measurements and calculating the square root. 
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Supermassive black holes do not correlate with 
galaxy disks or pseudobulges 


John Kormendy’, R. Bender®® & M. E. Cornell! 


The masses of supermassive black holes are known to correlate with 
the properties of the bulge components of their host galaxies’ *. In 
contrast, they seem not to correlate with galaxy disks’. Disk-grown 
‘pseudobulges’ are intermediate in properties between bulges and 
disks®; it has been unclear whether they do’” or do not’~’ correlate 
with black holes in the same way that bulges do. At stake in this 
issue are conclusions about which parts of galaxies coevolve with 
black holes”, possibly by being regulated by energy feedback from 
black holes'’. Here we report pseudobulge classifications for 
galaxies with dynamically detected black holes and combine them 
with recent measurements of velocity dispersions in the biggest 
bulgeless galaxies’*. These data confirm that black holes do not 
correlate with disks and show that they correlate little or not at 
all with pseudobulges. We suggest that there are two different 
modes of black-hole feeding. Black holes in bulges grow rapidly 
to high masses when mergers drive gas infall that feeds quasar-like 
events. In contrast, small black holes in bulgeless galaxies and in 
galaxies with pseudobulges grow as low-level Seyfert galaxies. 
Growth of the former is driven by global processes, so the biggest 
black holes coevolve with bulges, but growth of the latter is driven 
locally and stochastically, and they do not coevolve with disks and 
pseudobulges. 

In Fig. 1b, we show the well-known correlation’* between dynam- 
ically measured black-hole masses, M@, and the absolute magnitudes 
of elliptical galaxies (Fig. 1b, black points) and the bulge parts of disk 
galaxies (Fig. 1b, red points). The correlation has y* = 12 per degree of 
freedom, implying moderate intrinsic scatter. This result and a tighter 
correlation’ > between Me and host velocity dispersion, o (Fig. 2), 
motivate the idea that black holes and host bulges evolve together 
and regulate each other’®”’. All new results in this paper stand in 
contrast to these two correlations. 

In Fig. 1a, we plot Me versus the K-band absolute magnitude, Mx, 
of only the disk part of the host galaxy, with the bulge luminosity 
removed. We conclude that black holes do not correlate with galaxy 
disks. This confirms an earlier conclusion’ based on the more indirect 
observation that black holes do not correlate with total (bulge plus 
disk) luminosities of disk galaxies. In Fig. 1a, the colour of each point— 
which indicates bulge type (see below)—is irrelevant. A least-squares 
fit to the red and blue points has correlation coefficient r = 0.41. Errors 
at 1s.d. imply 7” = 81 per degree of freedom: the data do not respect 
the weak anticorrelation. The green points for pure-disk (that is, com- 
pletely bulgeless) galaxies further confirm the large scatter and lack of 
correlation. 

In Fig. 1c, we plot Me versus Mx for pseudobulge components with 
disk light removed. They are also included in Fig. 1b (light blue) to 
show how they compare with classical bulges and elliptical galaxies. 
Pseudobulges required explanation, as follows. 

Much work over several decades has shown that the high-stellar- 
density central components in disk galaxies—all of which used to be 
called ‘bulges’-—come in two varieties. How to distinguish them is 
discussed in Supplementary Information. The difference was first 


found observationally but is now understood to be a result of fun- 
damentally different formation mechanisms’. 

Classical bulges (red points in Figs 1 and 2) are indistinguishable 
from ellipticals in their structure, velocity distributions and parameters. 
Our well-developed picture is that they formed by galaxy mergers’ in 
our hierarchically clustering universe. Mergers are discrete events that 
are separated by long “dead times’. They occur on short timescales, 
approximately equal to the crossing time of the merging galaxies. 
Gravitational torques scramble disks into ellipticals'* and dump large 
quantities of gas into the centre. Observations'* and theory’ suggest 
that the process feeds both starbursts and black holes and causes the 
latter to grow rapidly in quasar-like events. 

Pseudobulges® (blue points in Figs 1 and 2) are observed to be more 
disk-like than classical bulges. They are believed to form more gently by 
the gradual internal redistribution of angular momentum in quiescent 
galaxy disks. The driving agents are non-axisymmetries such as bars. 
One result is the gradual build-up of a high-density central component 
that can be recognized (Supplementary Information) because it remains 
disk-like. These components are called ‘pseudobulges’ to emphasize 
their different formation mechanism without forgetting that they look 
superficially like—and are commonly confused with—classical bulges. 
The difference from bulges that is most relevant here is that the gradual 
gas infall that builds pseudobulges may concurrently provide less black- 
hole feeding and thus may drive slower black-hole growth. One purpose 
of this paper is to contrast black hole/bulge and black hole/pseudobulge 
correlations to look for clues about black-hole growth mechanisms and 
the consequent coevolution (or not) of black holes with host galaxies. 

With this context, we can interpret Fig. 1. Galaxies that contain 
classical bulges are consistent with the correlations for elliptical galaxies 
except for one discrepant object (the bulge-dominated SO galaxy NGC 
4342). The implication is that classical bulges and ellipticals coevolve 
with black holes in the same way. For that coevolution, it is irrelevant 
that bulges are now surrounded by disks whereas ellipticals are not. 

Wereach a different conclusion for pseudobulges on the basis of new 
classifications and measurements of pseudobulge-to-total luminosity 
ratios for all disk galaxies in our sample* that have dynamical black-hole 
detections (J.K., manuscript in preparation; see Supplementary 
Information for a list). A conservative interpretation of Fig. 1b is that 
pseudobulges are roughly consistent with the correlation for classical 
bulges and ellipticals but have much more scatter. In particular, some 
deviate from the correlation for ellipticals in having smaller black holes. 
This was not seen in some previous work'” because samples were small 
and because many pseudobulges in black-hole galaxies had not been 
classified. But, as published samples have grown larger, the hints have 
grown stronger that pseudobulges do not correlate with black holes in 
the same way as classical bulges’ ’. We confirm these hints. Particularly 
compelling is the fact that our galaxies and a new sample of black-hole 
detections based on water masers” have no overlap and independently 
lead to the same conclusion. 

Figure 1c shows the pseudobulges without guidance from the red 
and black points. The sample is small, but we have enough dynamic 
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Figure 1 | Correlations of dynamically measured black-hole masses with the 
luminosities of different parts of their host galaxies. Here Mx is the K-band 
(2.2-um) absolute magnitude of the disk component with bulge light removed 
(a), of the bulge with disk light removed (b) and of the pseudobulge with disk 
light removed (c). All plotted data are published elsewhere; parameters and 
sources are discussed in Supplementary Information, and those for disk galaxies 
are tabulated there. Elliptical galaxies are plotted in black, classical bulges are 
plotted in red and pseudobulges are plotted in dark blue. One galaxy with a 
dominant pseudobulge but with a possible small classical bulge (NGC 2787) is 
plotted with a blue symbol that has a red centre. In least-squares fits, it is 
included with the pseudobulges. Error bars, 1 s.d. In b, the red and black points 
show a good correlation between Me and bulge luminosity: a symmetric, least- 
squares fit‘ of a straight line has y* = 12.1 per degree of freedom and a Pearson 
correlation coefficient of r= —0.82. (All 7° values quoted in this paper are per 
degree of freedom.) In contrast, in a the red and blue points together confirm a 
previous result! that black holes do not correlate with disks: 77 = 81 and 

r = 0.41. Green points are for galaxies that contain neither a classical bulge nora 
pseudobulge but only a nuclear star cluster, that is, pure-disk galaxies. They are 
not included in the above fit, but they strengthen our conclusion. Similarly, in 
c the blue points for pseudobulges show no correlation: 7” = 63 and r = 0.27. In 
all panels, galaxies that have only M@ limits are plotted with open symbols; they 
were chosen to increase our dynamic range. They too support our conclusions. 
This figure uses K-band magnitudes to minimize effects of star formation and 
internal absorption, but in Supplementary Information we show that Fig. 1 
looks essentially the same for V-band (0.55-lym) magnitudes. 
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Figure 2 | Correlation of dynamically measured black-hole masses with the 
velocity dispersions of their host galaxies. Black points are for elliptical 
galaxies, red points are for classical bulges, blue points are for pseudobulges and 
the green point is for a nuclear star cluster. Data sources are given in 
Supplementary Information. Error bars, 1 s.d. The red and black points show 
the well known Me-c correlation”: 7° = 5.0 per degree of freedom and 

r = 0.89. Reducing 7? to 1.0 implies that the intrinsic scatter in log(M@/M<) at 
fixed o is 0.26, consistent with previous derivations*”. This is the tightest 
correlation between black holes and host galaxy properties and the one that 
most motivates the idea that black holes and bulges coevolve. In contrast, the 
blue points for pseudobulges show no correlation: y* = 10.4 and r = —0.08. 
This extends suggestions’ that the M@-o relation for pseudobulges is 
different from that for classical bulges and elliptical galaxies. 


range to conclude that we see no correlation at all. The cumulative 
amount of black-hole growth is not extremely different in classical 
bulges and pseudobulges, but there is no sign in the correlations that 
black-hole feeding has affected the pseudobulges. 

The second and more compelling black hole/host galaxy correlation 
is the one between Me and the velocity dispersion, o, of the stars at 
radii where they do not feel the black hole gravitationally**. Here a is 
averaged inside the ‘effective radius’, r., that contains half of the bulge 
light. Figure 2 shows this correlation. 

As is well known, ellipticals and classical bulges share the same tight 
correlation. But as in Fig. 1, pseudobulges at best show a much larger 
scatter (Fig. 2a). Without the guidance of the red and black points 
(Fig. 2b), they show essentially no correlation. Larger samples that 
reach smaller values of M@ may show a weak relationship'**°. But 
we conclude that classical bulges and pseudobulges show very different 
correlations with M@. Those for classical bulges are tight enough to 
suggest coevolution. Whether pseudobulges correlate with Me with 
large scatter or not at all, the weakness of any correlation (r = —0.08 
here) makes no compelling case that pseudobulges and black holes 
coevolve, beyond the obvious expectation that it is easier for bigger 
black holes and bigger pseudobulges to grow in bigger galaxies that 
contain more fuel. 

From the point of view of galaxy formation by hierarchical cluster- 
ing, pseudobulge galaxies are already pure-disk galaxies'’*. Even more 
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extreme galaxies that contain neither a classical bulge nor a pseudo- 
bulge can, in some cases, also contain black holes. Some have active 
galactic nuclei (AGNs), although these are rare in bulgeless galaxies”. 
And black holes with Me = 10°Ms5-10°M.« (Mo, solar mass) have 
been discovered with confidence in bulgeless (and pseudobulgeless) 
galaxies'**". The most extreme example is NGC 4395, an Sm galaxy 
that contains only a tiny, globular-cluster-like nucleus but that has an 
AGN powered by a black hole” with Me = (3.6 £1.1) X 10°M well 
measured by reverberation mapping”. It is the green point with the 
small error bar in Fig. 2. Other such objects include the Sd galaxy NGC 
3621” and the spheroidal galaxy POX 52, which contains an AGN 
powered by a black hole of mass Me ~ 10°Mq (refs 18, 25). Also, it is 
likely that low-M@ AGN samples include bulgeless galaxies'*”, 
although some are distant and not well resolved. 

The lack of correlation of black holes with disks and pseudobulges 
plus the discovery of black holes in bulgeless galaxies together are 
critical clues to black-hole feeding mechanisms. They motivate the 
following hypothesis: we suggest that there are two fundamentally 
different feeding mechanisms for black holes. 

The traditional mechanism is rapid feeding during major mergers, 
when large amounts of gas fall into galaxy centres. We follow sugges- 
tions’*"* that, at some, perhaps late, stage in the merger, the black hole 
grows quickly in a quasar-like event. These are the growth episodes 
that dominate the waste mass budget of quasar activity’*”* that is well 
explained by the observed density of the biggest black holes in the 
universe’’. In this mode, black-hole and galaxy growth are controlled 
by the same global processes, and these, we suggest, result in black 
hole/bulge coevolution. If black holes and galaxy formation ever regu- 
late each other"’, this is the likely setting in which it happens. 

In contrast, the nuclear activity in bulgeless galaxies is, by and large, 
weak’', Big pseudobulges (NGC 1068 is the highest-luminosity one in 
Fig. 1) can host classical Seyfert nuclei, but there is no sign that these 
affect global galaxy structure. NGC 1068 is a prototypical oval galaxy® 
with a prominent pseudobulge that, we believe, formed slowly by 
inward gas transport in spite of episodic nuclear activity. We suggest 
that the second mode of black-hole growth is such weak nuclear activity 
driven stochastically by local processes that feed gas from ~10* pc, 
where it make a pseudobulge, in to the black hole. The processes are 
not understood in detail. But exactly this kind of feeding mode is pro- 
posed and modelled in ref. 26; the models suggest that this kind of 
feeding does not affect galaxy formation. Differences in black-hole 
growth and coevolution with host galaxies have also been proposed in 
studies of the lowest-Me black holes detected mainly through AGN 
activity'*'*. The same picture can also be reached by studying AGN 
demographics”*. Other recent papers” further explore local processes 
of black-hole feeding. Figures 1 and 2 tell us that this mode involves little 
or no coevolution of black holes with any component of the host galaxy. 

The seed black holes that grew into today’s supermassive black holes 
are not securely identified. But the smallest black holes that are grown by 
means of local feeding plausibly remain most like those seeds. Because 
mergers of disk galaxies are believed to make bulges and ellipticals, we 
suggest that the small black holes that are grown by local processes are the 
seeds of the generally larger black holes that are grown in part by mergers. 
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Supermassive black holes do not correlate with dark 


matter haloes of galaxies 


John Kormendy!?? & Ralf Bender? 


Supermassive black holes have been detected in all galaxies that 
contain bulge components when the galaxies observed were close 
enough that the searches were feasible. Together with the obser- 
vation that bigger black holes live in bigger bulges’, this has led to 
the belief that black-hole growth and bulge formation regulate each 
other®. That is, black holes and bulges coevolve. Therefore, 
reports®’ of a similar correlation between black holes and the dark 
matter haloes in which visible galaxies are embedded have pro- 
found implications. Dark matter is likely to be non-baryonic, so 
these reports suggest that unknown, exotic physics controls black- 
hole growth. Here we show, in part on the basis of recent measure- 
ments® of bulgeless galaxies, that there is almost no correlation 
between dark matter and parameters that measure black holes 
unless the galaxy also contains a bulge. We conclude that black 
holes do not correlate directly with dark matter. They do not cor- 
relate with galaxy disks, either®’®. Therefore, black holes coevolve 
only with bulges. This simplifies the puzzle of their coevolution by 
focusing attention on purely baryonic processes in the galaxy 
mergers that make bulges’’. 

The idea of coevolution was motivated by the observation that 
bigger black holes live in bulges and elliptical galaxies that have bigger 
velocity dispersions, o, at large radii where stars mainly feel each 
others’ gravity and not that of the black hole**. This correlation was 
compelling because its scatter was small, consistent with measurement 
errors. The reduced y value was 0.79 for the highest-accuracy sample’, 
implying that the intrinsic scatter in black-hole mass, Me, at fixed a is 
0.15 dex (ref. 4). The scatter was so small that o could be used as a 
surrogate for M@ for many arguments. More important was the 
implication that a fundamental physical connection between black- 
hole and bulge growth awaits discovery, especially given the realiza- 
tion’’ that even a tiny fraction of the energy produced in black-hole 
growth could, if absorbed by protogalactic gas, regulate bulge forma- 
tion. Small scatter will be important here, too. Tight correlations 
motivate a search for underlying physics. Loose correlations are less 
compelling: bigger galaxies just tend to be made of bigger galaxy parts. 

The discovery®”’ of a similarly tight correlation between o and the 
circular rotation velocities, V.i,., of gas in the outer parts of galaxies, 
where gravity is controlled by dark matter, therefore was taken to 
imply that dark matter also regulates black-hole growth. In fact, it 
was suggested that the more fundamental correlation is the one with 
dark matter; in other words, that dark matter engineers coevolution. 

The proposed black hole/dark matter correlation raised two con- 
cerns. First, it was known that black holes do not correlate with galaxy 
disks’, whereas galaxy disks correlate closely with dark matter'?*. It 
was therefore not clear how black holes and disks could separately 
correlate with dark matter without also correlating with each other. 
Second, the velocity resolution of some o measurements was too low to 
resolve narrow spectral lines; this problem is discussed in the legend of 
Fig. 1. 

If dark matter controls black-hole growth and bulges are essentially 
irrelevant, then V.;,. should correlate tightly with o even in galaxies 


that do not have bulges. Figure 1 tests this hypothesis. It also updates 
the plot that was used to claim’ a black hole/dark matter correlation. 
The reliable original data are shown in black; points measured with low 
velocity resolution were omitted as documented in Supplementary 
Table 1. Motivated by the above discussion, we measured® velocity 
dispersions in six Sc-Scd galaxies that have nuclear star clusters 
(‘nuclei’) but essentially no bulges. They are shown by red points in 
Fig. 1. Other coloured points show additional published data on bulge- 
less galaxies that were measured with enough spectral dispersion to 
resolve nuclear o. 

Bulgeless galaxies (Fig. 1, coloured points plus NGC 3198) show only a 
weak correlation between Viz. and o. This is expected, because bigger 
galaxies tend to have bigger nuclei’. However, the lack of a tight correla- 
tion suggests no more compelling formation physics than the expectation 
that bigger nuclei can be manufactured in bigger galaxies that contain 
more fuel. The scatter is much larger than the measurement errors: 
7° = 15.7. We note in particular that galaxies with o ~ 25kms ' span 
almost the complete V.i,- range for the coloured points, 96-210 km s — " 

Our measurements* were made with the 9.2-m Hobby-Eberly 
Telescope and High Resolution Spectrograph; the instrumental velocity 
dispersion of Ging = 8kms*' reliably resolves the smallest velocity dis- 
persions seen in galactic nuclei. We confirm that ¢ = 19.8+0.7kms | 
in galaxy M33 and include this value in Fig. 1. 

Two properties of our sample deserve emphasis. First, we observed 
NGC 5457 (also known as M101) and NGC 6946 because these are 
among the biggest bulgeless galaxies (Vere ~210km s '). This is 
important because it was concluded in ref. 6 that (using our notation) 
“the Veire-o relation...seems to break down below Veire~ 150kms ””, 
implying that “halos of mass smaller than ~5 X 10''Mj are increas- 
ingly less efficient at forming [supermassive black holes]”. Our galaxy 
measurements show that the V.,.-o relation breaks down even at 
Vere = 210kms | if the galaxy contains no bulge. 

Second, our sample is intentionally biased against galaxies that con- 
tain bulges. We even avoided substantial ‘pseudobulges’, that is, ‘fake 
bulges’ made by the internal evolution of galaxy disks’® rather than by 
the galaxy mergers that make ‘classical bulges’. The pseudobulge-to- 
total mass ratios of our galaxies are a few per cent or less; the bulge-to- 
total mass ratios are zero. The relevance of pseudobulges is discussed 
below. We chose these galaxies because, as noted earlier, we want to 
know whether dark matter correlates with black holes in the absence of 
the component that we know correlates with black holes. A study’’ of a 
large galaxy sample that is not biased against bulges results in similar 
conclusions: V.;,- correlates weakly with o, especially for galaxies of 
Hubble types that commonly contain classical bulges, but the scatter is 
large and “these results render questionable any attempt to supplant 
the bulge with the halo as the fundamental determinant of the central 
black hole mass in galaxies”’’. Results from this study are included in 
Supplementary Figure 3. 

Figure 1 shows substantial overlap in V.;,. between the coloured points 
that show little correlation with o and the black filled circles that show 
good correlation. In the overlap range, 180 km s~ "S Vere S 220kms™', 
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Figure 1 | Outer rotation velocities versus near-central velocity dispersions 
of disk galaxies. Data are listed in Supplementary Information. Error bars, 
1s.d. The original black hole/dark matter correlation® is shown using black 
symbols (circled if the galaxy has a classical bulge) except that points have been 
omitted if the o measurement had insufficient velocity resolution. For example, 
the bulgeless Scd galaxy IC 342 (now the orange point, after correction) was 
previously plotted’ at ¢ = 77 + 12kms_', consistent with the black points. But 
the measurement” had low resolution: the instrumental velocity dispersion, 
instr = (resolution full-width at half-maximum)/2.35, was 61 kms’, similar 
to the value of o measured for IC 342. Low resolution often results in 
overestimation of o. The same source lists ¢ = 77 kms! for the nucleus of 
M33, which has og = 21 +2kms | as measured at high resolution”*””. In fact, a 
high-resolution measurement of IC 342 was available”*: at Ging: = 5.5kms_',o 
is observed to be 33 + 3kms * (orange point). We replaced four filled circles 
for which better o measurements are available; three of these are now coloured 
points, and the fourth is NGC 3198. We omitted five plus-shaped points for 
which ¢ S Ginstr. And we added points (colour) for galaxies measured with 
instr < 10 kms‘, that is, high enough resolution to allow measurement of the 
smallest velocity dispersions seen in galactic nuclei. The line (equation at 
bottom; velocities are in units of 200 kms!) isa symmetric least-squares fit’ to 
the black filled circles minus NGC 3198. It has 7” = 0.25. The correlation 
coefficient is r = 0.95. This correlation is at least as good as the one between Me 
and o. The correlation for the plus-shaped points has y* = 2.6 and r = 0.77. For 
these galaxies, we have only optical rotation curves, which measure V.i;- less 
accurately than do HI measurements because they do not reach as far out into 
the dark matter halo’. They show a weaker correlation that is not a compelling 
argument for coevolution. Weaker still is the correlation for the coloured points 
plus NGC 3198: it has y* = 15.7 and r= 0.70. 


galaxies satisfy the tight V.i,—o relation shown by the black filled 
circles only if they contain bulges. We conclude that baryons must 
matter to black-hole growth. But baryons in a disk are not enough to 
allow us to predict M@. Dark matter by itself is not enough. The galaxy 
M101 (Fig. 1, top-left red point) has a halo that is similar to those of half 
of the tightly correlated galaxies, but that halo did not manufacture a 
canonical black hole in the absence ofa bulge. This suggests that bulges, 
not haloes, coevolve with black holes. 

Nevertheless, most of the black circles in Fig. 1 show a correlation 
whose scatter is consistent with the error bars. We need to understand 
this. 

We suggest that the tight correlation of black points in Fig. 1 is a 
result of the well-known ‘conspiracy’*"* between baryons and dark 
matter to make featureless rotation curves with no distinction between 
the parts that are dominated by baryonic and non-baryonic matter. 
This possibility was considered and dismissed in ref. 6. However, it is a 
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natural consequence of the observation that baryons make up 17% of 
the matter in galaxies’*® and that, to make stars, they need to dissipate 
inside their haloes until they are self-gravitating. This is sufficient to 
engineer that V.;. be approximately the same for dark matter haloes 
and for disks embedded in them’’”®. That part of the conspiracy is not 
shown by Fig. 1 because, absent a bulge, disks reach V.i,, at large radii 
that are not sampled by o measurements of nuclei. 

Bulges dissipate more than disks. The consequences are shown in 
Supplementary Figs 1 and 2. Supplementary Fig. 1 shows that Vi; for 
the bulge approximately equals V i, for the halo in the two highest- 
V ire galaxies whose points are circled in Fig. 1. Supplementary Fig. 2 
shows that the same approximate equality holds, given the uncertainties 
in rotation-curve decomposition, for all decompositions that we could 
find that included a bulge. It holds in just the V.i,, range, 180-260 km 
s_', where the black circles in Fig. 1 show tight correlation. Because a 
bulge has Veire © \/2o, a correlation such as that in Fig. 1 is expected 
from Supplementary Fig. 2. All galaxies that participate in the tight 
correlation in Fig. 1 are included in Supplementary Fig. 2, and all of 
them have bulges or pseudobulges. We conclude that the correlation is 
nothing more nor less than a restatement of the rotation-curve con- 
spiracy for bulges and dark matter. It is a consequence of dark-matter- 
mediated galaxy formation. The conceptual leap to a direct causal cor- 
relation between dark matter and black holes is not required by the data. 

So far, we have discussed black-hole correlations indirectly using the 
assumption that o is a surrogate for black-hole mass. We now check 
this assumption and show that it is not valid for most of the black 
points that define the tight correlation in Fig. 1. If o is not a measure of 
Me for these galaxies, then this further shows that the correlation is 
not a consequence of black hole/dark matter coevolution. 

In Fig. 2, we examine directly the correlations between Me and host 
galaxy properties for galaxies in which black holes have been detected 
dynamically. All plotted parameters are published elsewhere. The galaxy 
sample and plotted data are listed in Supplementary Information of the 
accompanying Letter'®. The same galaxies are shown in all panels except 
that ellipticals do not appear in Fig. 2c because they do not have disks; 
bulgeless galaxies do not appear in Fig. 2a because they do not have 
bulges; and some bulgeless galaxies and pseudobulge galaxies with Me 
limits do not appear in Fig. 2b because c is outside the range of the plot. 

The top panels correlate M@ with the luminosity (Fig. 2a) and 
velocity dispersion (Fig. 2b) of the host galaxy bulge. Ellipticals (black) 
and classical bulges (red) show the good (Fig. 2a) and better (Fig. 2b) 
correlations that we have come to expect. 

Figure 2c shows’? that galaxy disks do not correlate with Me. Disk 
masses, which are approximately proportional to their K-band 
luminosities, cannot be used to predict Me. 

Figure 2 also distinguishes classical bulges (red points) from pseudo- 
bulges (blue points). Classical bulges are essentially indistinguishable in 
structure and parameter correlations from elliptical galaxies (black 
points). We believe that both formed by galaxy mergers (see below). 
Pseudobulges are high-density, central components in galaxies that 
superficially resemble—and often are mistaken for—classical bulges 
but that can be recognized because their properties are more disk-like 
than those of classical bulges. We now know that this results from fun- 
damentally different formation histories. Complementary to hierarchical 
clustering”’, a new aspect of our understanding of galaxy formation"® is 
that isolated galaxy disks evolve slowly as non-axisymmetries such as 
bars redistribute angular momentum. During this process, pseudo- 
bulges are grown out of disk material. Bulge—pseudobulge classifica- 
tions are listed for all objects in our sample in Supplementary 
Information of ref. 10. Figure 2a, b illustrates a conclusion from that 
Letter which has consequences here: pseudobulges show essentially no 
correlation between Me and o. Baryons do not predict Me if they are 
in a pseudobulge. 

If Me and o do not correlate for pseudobulges, then o is not a 
surrogate for Mg in Fig. 1, either. Bulge classifications for the galaxies 
shown in Fig. 1 are given in Supplementary Information, and two 
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Figure 2 | Correlations of dynamically measured black-hole masses with 
structural parameters of host galaxies. a, Black-hole mass, Me, versus the 
K-band absolute magnitude of the host galaxy bulge, Mx bulge with disk light 
removed. b, Me versus the velocity dispersion, o, of the host bulge averaged 
inside the radius that contains one-half of the bulge light. Elliptical galaxies are 
plotted in black, classical bulges are plotted in red and pseudobulges are plotted 
in blue. The blue symbol with the red centre represents the galaxy NGC 2787, 
which has a dominant pseudobulge and possibly also a small classical bulge. In 
least-squares fits, it is included with the pseudobulges. c, d, Analogous 
correlations for the disks of host galaxies: M@ versus the K-band absolute 
magnitude of the disk, Mx disk, with bulge light removed (c); Me versus outer 
rotation velocity, Vir: (d). Green points are for galaxies that contain neither a 
classical bulge nor a large pseudobulge but only a nuclear star cluster. Closely 
similar versions of a-c appear in the accompanying Letter’®; all data plotted 
here are tabulated there in Supplementary Information. Errors bars, 1 s.d. On 
the basis of these, classical bulges and ellipticals in a together have y” = 12.1 and 
r = —0.82; this is the well-known good correlation and is consistent with 
previous derivations’. We assume 1- 3s d. errors in Mx of +0.05 mag. Similarly, 
in b, the red and black points have y* = 5.0 and r = 0.89, implying an intrinsic 
scatter in log)Me@/M.«) of 0.26 at fixed o to reduce va to 1.0. This also is 
consistent with previous derivations”’”’. It is the good correlation that 
motivates our ideas about black hole/bulge coevolution. In contrast, 
pseudobulges do not correlate with M@; in a, the blue points have va = 63 and 
r = 0.27, and in b, they have a = 10.4and r= —0.08. Similarly’®, disks do not 
correlate with Me: in ¢, the red and blue points together have ¢ = 81 and 
r= 0.41. Formally, Me and disk luminosity anticorrelate, but this is not 
significant. Finally, in d, the blue points have va = ll andr = 0.29, showing no 
correlation. In contrast, the red points, for classical bulges, show a correlation 
that, we argue, is a restatement of the rotation-curve conspiracy. Here elliptical 
galaxies and some SO galaxies are plotted in light grey and pink, respectively, to 
indicate that V.i,. is not known. We therefore plot the surrogate quantity 20, 
and it agrees with the correlation seen for bulges that have V.;,, measurements. 
dis a direct demonstration that, absent bulges, black holes do not correlate with 
dark matter. In all panels, open symbols represent galaxies for which we have 
only upper limits on M @; they are not included in the above statistics, but they 
further strengthen our conclusions. 


galaxies with no published data are classified. We find that only four of 
the black points (circled) in Fig. 1 represent classical bulges; these are 
M31, NGC 2841, NGC 4258, and NGC 7331. The others represent 
pseudobulges. For these, the demonstration of a tight V.;,.-o correla- 
tion is not a demonstration that dark matter and black holes correlate. 

Instead, if V.;,. correlates with o but o does not measure Me, then 
our conclusion that the correlation results from the rotation-curve 
conspiracy gains further support. Also, the circled points for classical 
bulges agree with the correlation for pseudobulges. It is implausible 
that the correlation for the circled points is caused by black hole/dark 
matter coevolution whereas the identical correlation for the other 
points has nothing to do with black holes. 
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Finally, Fig. 2d shows directly that M@ does not correlate with Vii, 
and, therefore, with dark matter for pseudobulges. 

An additional argument is given in Supplementary Information, 
section 3. If dark matter V.;,. predicts M@ independently of baryon 
content, then the dark matter with V.;,.~ 1,500 kms? in clusters of 
galaxies predicts black holes of mass Me ~7 X 10''M.~ that are 
impossible to hide in well-studied clusters such as Coma. 

Therefore, over the whole range of V.;,. values associated with dark 
matter, that is, at least 50-2,000 kms /, Fig. 2d shows a correlation 
with Me only for the range 200-400 kms’, plus NGC 7457 at 105 km 
s ', and only if the galaxy contains a classical bulge. The bulge cor- 
relation can be understood as an indirect result of the rotation-curve 
conspiracy. Even for Voir in the range 200-400 kms ', there is no 
correlation if the galaxy has only a pseudobulge or disk. Baryons are 
not irrelevant. They are not even sufficient. To correlate with M @, they 
must be in a classical bulge or elliptical. 

We conclude that black holes do not correlate causally with dark 
matter haloes. There is no reason to expect that the unknown, exotic 
physics of non-baryonic dark matter directly affects black-hole 
growth. Even dark matter gravity is not directly responsible for black 
hole/galaxy coevolution. Rather, that coevolution seems to be as simple 
as it could be. Black holes coevolve only with classical bulges and 
ellipticals. We have a well-developed picture of the their formation. 
Hierarchical clustering of density fluctuations in cold dark matter 
results in frequent galaxy mergers'!”’”?. The products of roughly 
equal-mass mergers are classical bulges and ellipticals, because pro- 
genitor disks get scrambled away by dynamical violence’’. During this 
process, gas falls to the centre, triggers a burst of star formation**** and 
builds the high stellar densities that we see in bulges. This gas may also 
feed black holes. In fact, we see a correspondence”’ between mergers in 
progress and quasar-like nuclear activity that increases Me. Our 
increasingly persuasive picture is that the growth of black holes and 
the assembly of ellipticals happen together and regulate each other’?”*. 
The present results support this picture. 
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Directed self-assembly of a colloidal kagome lattice 


Qian Chen’, Sung Chul Bae’ & Steve Granick’?" 


A challenging goal in materials chemistry and physics is sponta- 
neously to form intended superstructures from designed building 
blocks. In fields such as crystal engineering’ and the design of porous 
materials*“*, this typically involves building blocks of organic mol- 
ecules, sometimes operating together with metallic ions or clusters. 
The translation of such ideas to nanoparticles and colloidal-sized 
building blocks would potentially open doors to new materials and 
new properties” ’, but the pathways to achieve this goal are still 
undetermined. Here we show how colloidal spheres can be induced 
to self-assemble into a complex predetermined colloidal crystal—in 
this case a colloidal kagome lattice* '"— through decoration of their 
surfaces with a simple pattern of hydrophobic domains. The build- 
ing blocks are simple micrometre-sized spheres with interactions 
(electrostatic repulsion in the middle, hydrophobic attraction at 
the poles, which we call ‘triblock Janus’) that are also simple, but 
the self-assembly of the spheres into an open kagome structure con- 
trasts with previously known close-packed periodic arrangements of 
spheres'*"**. This open network is of interest for several theoretical 
reasons’ '°. With a view to possible enhanced functionality, the 
resulting lattice structure possesses two families of pores, one that 
is hydrophobic on the rims of the pores and another that is hydro- 
philic. This strategy of ‘convergent’ self-assembly from easily fabri- 
cated”* colloidal building blocks encodes the target supracolloidal 
architecture, not in localized attractive spots but instead in large 
redundantly attractive regions, and can be extended to form other 
supracolloidal networks. 

Colloidal crystals are important for their proposed applications in 
photonics, biomaterials, catalytic supports and lightweight structural 
materials. They also serve as model systems in which to study the phase 
behaviour and crystallization kinetics of atomic and molecular crys- 
tals'?'°. Usually composed of hard spheres that are homogeneous 
in surface functionality, their spontaneous formation is mostly 
induced by the minimization of entropy, which results in a limited 
selection of attainable close-packed crystal types'**. More complex 
crystals assembled from similarly homogeneous spheres have been 
constructed in binary colloidal’’ and template-assisted systems'*. To 
achieve programmable formation of crystals, building blocks with 
designed specific surface functionalities such as DNA linkers’’”? and 
attractive ‘patches’? have been proposed, but these approaches 
pose synthetic challenges and can be difficult to generalize. For 
example, the kagome lattice (see Fig. 1), which is of theoretical interest 
for mathematical reasons® as well as its relevance to mechanical 
stability of an isostatic lattice’ and frustration in magnetic materials”, 
is composed of interlaced triangles whose vertices have four contacting 
neighbours. To construct it by direct assembly would require colloids 
with four unevenly distributed patches on their equators to line up 
precisely with their counterparts on neighbouring spheres (see 
Supplementary Fig. 1a) but methods to obtain the desired colloids 
are not immediately accessible. 

Accordingly, we chose the kagome lattice as our target colloidal 
crystal, and produce it using the following alternative strategy. To 
reduce the need to start with a specific pattern of attractive spots on 
each building block, we designed a building block with the orthogonal 


attributes of minimal surface design combined with self-adjusted coor- 
dination number. This simplifies the original four-patch decoration 
scheme into one with two patches at opposite poles, each of which 
subtends an angle in the plane large enough to allow coordination with 
two nearest neighbours (see Supplementary Fig. 1b). This has the 
advantage that established synthetic methods”* can be used to decorate 
spherical particles with two hydrophobic poles of tunable area, sepa- 
rated by an electrically charged middle band. Because each of the 
hemispheres is chemically ‘Janus’ (two-sided)” with the same middle 
band, we refer to these as ‘triblock Janus’. 

This motif causes neighbouring particles to attract at their poles in a 
geometrical arrangement limited by their size, while avoiding energetically 
unfavourable contacts between the charged middle bands. After over- 
night sedimentation, the density mismatch between our gold-plated 
polystyrene particles and the water in which they are suspended con- 
centrates the particles into a quasi-two-dimensional system. Our syn- 
thetic scheme produces elongated caps (see Supplementary Fig. 2), which 
further facilitates assembly into two-dimensional networks because it 
allows two nearest neighbours only when the long patch axes of neigh- 
bouring particles are parallel. Ordering is then switched on at will by 
adding salt (3.5 mM NaCl in these experiments) to these spheres in 
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Figure 1 | Colloidal kagome lattice after equilibration. a, Triblock Janus 
spheres hydrophobic on the poles (black, with an opening angle of 65°) and 
charged in the equator section (white), are allowed to sediment in deionized 
water. Then NaCl is added to screen electrostatic repulsion, allowing self- 
assembly by short-range hydrophobic attraction. b, Fluorescence image of a 
colloidal kagome lattice (main image) and its fast Fourier transform image 
(bottom right). Scale bar is 4 1m. The top panel in c shows an enlarged view of 
the dashed white rectangle in b. Dotted red lines in c highlight two staggered 
triangles. The bottom panel in ¢ shows a schematic illustration of particle 
orientations. 
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deionized water; the salt screens electrostatic repulsions and allows 
hydrophobic attraction to come into play. The successful completion 
of the ordering process requires the energy landscape of the system to be 
smooth enough for kinetically favoured intermediates to transform 
finally to the thermodynamically favoured product with maximized 
hydrophobic contacts. Experimentally, the hydrophobic attraction of 
around 10kgT (where kg is the Boltzmann constant and T is temper- 
ature) per contact” allows self-correction of imperfectly aligned bonds, 
which is advantageous because kinetically formed intermediate defects 
finally transform to the favoured structure over a convenient timescale. 
The scheme of self-assembly is summarized in Fig. 1a. 

Fluorescence images of the final product (Fig. 1b) reveal single- 
phase web-like sheets that tessellate the surface in the known pattern 
of a kagome lattice. Interestingly, the two populations of pores in this 
lattice, triangular and hexagonal, each possess inherently different 
microenvironments on the pore rims. As demonstrated in Fig. 1c, 
hexagonal cavities are surrounded by negatively charged rims and 
triangular cavities are surrounded by hydrophobic rims. Previously, 
others have succeeded in forming kagome lattices when functional 
molecules self-assemble epitaxially onto certain metals'’, but our sub- 
strate is essentially inert; it simply carries negative charge in order to 
prevent colloids from sticking to it. Accordingly, a key difference from 
such molecular systems is that lateral positions of elements of the 
kagome lattice display thermal fluctuation around their mean posi- 
tions, as shown in Supplementary Movie 1. In principle, this should 
enable direct measurement of the vibration modes and phonon struc- 
ture”, which are of interest theoretically’. Although it might seem 
obvious which structure we will observe, computer simulations of 
similar particles show a different structure, close-packed with altern- 
ating attractive bands*’. The key difference appears to be that experi- 
mentally, particles were free to exchange outside the monolayer. This 
contrast between experiment and simulation implies that different 
crystal structures should be observed at low and high pressure. We 
anticipate another phase transition—melting—if the attraction were 
weaker, as could be achieved experimentally by using a mixed mono- 
layer of alkane thiols instead of the strongly hydrophobic monolayers 
used here. 

We now follow the ordering process, which is difficult to do with 
surface-templated systems. Defining the start of the experiment as the 
moment when salt is added, fluorescence imaging shows that particles 
first recognize the existence of neighbours by clustering into kinetically 
favoured triangles, strings or a combination of the two. Subsequently, 
they rapidly coordinate with additional particles to maximize contacts 
between hydrophobic poles, resulting in chunks of network with 
defects consisting mainly of irregular voids. A typical early-stage image 
is shown in Fig. 2a. The sample at this time contains some web-like 
structures (Fig. 2b) and other strings with dangling bonds (Fig. 2c). 
Quantitative analysis of their relative abundance (Fig. 2d) shows, with 
elapsed time, a monotonic decrease of monomers, a monotonic 
increase of web-like structures, and a peak in the abundance of strings; 
hence, strings are a kinetically favoured intermediate that does not 
require patch alignment as accurate as in webs and that has more 
translational and rotational freedom. In Supplementary Movie 2, we 
follow the typical evolution of a string first into triangular nodes with 
branches, then into enclosed pores. The particle number density in this 
movie is smaller than in Fig. 2a to demonstrate individual dynamic 
events. This system behaves similarly to the pattern in Fig. 2d at this 
stage of assembly, but at a slower rate. 

For the kagome lattice to develop fully requires much longer. After 
the formation of irregular webs, the colloids first adopt quasi-kagome 
order in local areas, which then extend. This growth of the final struc- 
ture from metastable intermediates suggests that classical nucleation 
theory for the (well-studied) crystallization of closely packed spheres 
should be revisited; in this system, in determining the size and shape of 
critical nuclei, the concept of average surface tension may be quantita- 
tively or even qualitatively different. To quantify the ordering process, 
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Figure 2 | Stages of self-assembly of kagome lattice. a, Illustrative 
fluorescence image taken 35 s after initiation of assembly. Scale bar is 4 um. 
b, c, Enlarged views of a web-like area (b) and a single-particle string area 
(c), both highlighted in a by dashed white rectangles. Bonding types are shown 
in the accompanying schematic illustrations (blue lines in b and red lines in 
c). d, Time evolution of structures illustrated in a-c, showing the relative 
abundance of discrete spheres (black squares), strings (red circles), and 
triangular bonding (blue triangles). The time spans 0.2 to 12 min. 


for each particle in each time sequence of images, we determined and 
then quantified the local coordination of each particle to its neighbours 
using the local bond orientational order parameter W,;. Particles with 
four bonds and w,; larger than 0.7 were defined as locally crystalline". 
Figure 3 consists of a time series of images, showing isolated crystal 
bonding, then fusion into larger domains, and finally healing of non- 
crystalline bonds into ordered ones. The crystalline domains fluctuate, 
making the edges across ordered and random regions rough and some- 
times transient (Supplementary Movie 3). The graph in Fig. 3e charac- 
terizes the average of images of this kind and shows the time dependence. 
Interestingly, the approach follows typical first-order chemical reaction 
kinetics, during which time the particle number density remains nearly 
constant, with the fraction of product increasing rapidly at first, then 
more slowly as the available materials are depleted, and finally saturating 
exponentially to a plateau. Together with the earlier stage in which 
individual particles condense into irregular networks (Fig. 2 and 
Supplementary Movie 2), this is reminiscent of two-step nucleation in 
protein solutions, in which order is preceded by a dense amorphous 
state’. Eventually, we find that growing crystalline grains of different 
orientations impinge on one another. Supplementary Fig. 3 shows 
examples of long-range crystalline order, at dilute and concentrated 
concentrations. This polycrystalline structure could be annealed and 
its order improved by the usual methods of materials science. 
Furthermore, sheets of kagome lattice are found to stack, one above 
the other, in parallel layers. In a bilayer, both layers retain the in-plane 
order of a kagome lattice: in registry but staggered in orientation, so 
that the intersection of nodes of the two lattices forms an octahedron. 
An optical image of the resulting array of octahedra is shown in Fig. 4. 
This stacking maximizes the hydrophobic node comprised of six 
hydrophobic poles in each octahedron; alternative arrangements 
would be more costly in energy, requiring hydrophobic nodes in one 
plane to be positioned over the pores with charged rims in the second 
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Figure 3 | Crystallization of the kagome lattice. Fluorescence images taken at 
2.9h, 3.0h, 3.1h and 53.8 h (a, b, c and d, respectively). Blue circles denote the 
particles of local crystalline order as defined in the text. Blue lines are their 

neighbouring bonds. Scale bar is 4 j1m. e, The time evolution of the fraction of 


plane. Although technical limitations in visualizing the resulting struc- 
ture at present prevent us from visualizing the assembly to still thicker 
films, there is the potential to fabricate multilayers of vertex-sharing 
octahedral, pyrochlore and other hierarchical structures usually asso- 
ciated with inorganic crystals rather than colloids*. Looking ahead to 
possible applications, freezing these structures into place offers the 
potential to form selective membranes, in which some holes are hydro- 
phobic and others hydrophilic. 

These design rules suggest generalizations. Other open structures”® 
could be designed from triblock particles the coordination numbers of 


Figure 4 | A bilayer of parallel kagome lattices. a, Fluorescence image taken 
from the bottom. The eye sees octahedra consisting of staggered triangular 
nodes, as shown in the schematic illustration at bottom left. Scale bar is 4 um. 
b, Enlarged view of the dashed white square in a. The top panel in b shows a 
fluorescence image. The bottom panel in b shows a schematic view of particle 
arrangements from two perspectives, nearly vertical to the plane and nearly 
parallel to it. 
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particles bearing local crystalline order (black squares), corrected for particles 
with missing bonds at the boundary, which approaches a plateau with 
exponential kinetics (red line). Also shown is the particle number density (blue 
circles), normalized by its observed maximum. 


which differ on the opposing north and south poles. The needed 
modulation of the angular range of attraction could be achieved by 
lessening the size of the hydrophobic patch, or alternatively by less 
screening of repulsion. If building blocks were to carry four attractive 
patches distributed at tetrahedral angles, a diamond structure would be 
artificially designed, although implementation of this awaits the 
development of the synthetic methods needed to produce the parent 
particles*’. The common point is that colloidal building blocks, attract- 
ing one another reversibly, during the early stages of assembly 
assemble into intermediate clusters (strings, in the present experi- 
ments) with wide latitude in the mutual orientation of neighbouring 
particles. Subsequently, the orthogonal variable of geometrical shape 
then guides these transiently stable intermediates to the final structure. 


METHODS SUMMARY 


Fluorescent latex particles of sulphate polystyrene (1 jum in diameter, F-8851 from 
Invitrogen) are made hydrophobic on opposite poles through glancing angle 
deposition of titanium (2 nm) and gold (25 nm) thin films, followed by the deposi- 
tion of self-assembled monolayers of n-octadecanethiol (Sigma-Aldrich). After 
adding salt (NaCl) to deionized water to a final concentration of 3.5 mM, their 
self-assembly is observed at room temperature under epifluoresence microscopy. 
Images captured by an iXon electron multiplying charge coupled device 
(EMCCD) camera are analysed manually for the early stage of self-assembly, 
and by single-particle tracking code after particles begin to form web-like struc- 
tures. Because incomplete webs are only locally periodic, we use the local bond 
orientational order parameter, rather than a translational order parameter, to 
quantify the growth of ordered regions within the sample. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Particle fabrication. Patches on opposite poles of spherical particles are produced 
by sequential glancing angle deposition’®. First, a closely packed monolayer of 
1 um fluorescent sulphated latex particles (F-8851, Invitrogen) is fabricated on a 
silicon wafer substrate using a reported method”. In brief, 80 pl water/ethanol 
dispersion (volume ratio 1:1) containing 8 wt% latex particles is dropped onto the 
top surface of a 1cm X 1 cm piece of glass (pretreated by Piranha solution) sur- 
rounded by water located at the midbottom of a Petri dish. The dispersion spreads 
freely on the water surface until it covers nearly the entire area. Then 10 pl of 
sodium dodecyl sulphate (2 wt%) solution is added to the water surface to reduce 
the surface tension and condense the particles into a closely packed monolayer of 
about 16cm” in area. A silicon wafer (1.5cm X 2.5cm, pretreated by Piranha 
solution) is used to pick a piece of floating monolayer of particles, left to dry for 
later treatment. 

Second, glancing angle deposition of 2 nm Ti/25 nm Au layers onto the colloidal 
monolayer is performed as described'®. The glancing angle 0 is set to be 30° to the 
particle monolayer. After the first vapour deposition, the particle monolayer is 
lifted up with a polydimethylsiloxane (PDMS) stamp so that patches from the first 
vapour deposition are facing down. PDMS stamps are prepared by curing the 
monomer and crosslinking agent (10:1 w/w) (Dow Corning) at 70 °C ina pumped 
oven overnight. Just before stamping, the PDMS surface is treated with oxygen 
plasma to induce the necessary adhesion. The oxygen plasma is generated by a 
Harrick PDC-32G plasma cleaner. Low plasma power is used (6.8 W), and the 
chamber pressure is about 150 mTorr. The treatment duration is 45 s. The stamp- 
ing is carried out immediately after the plasma treatment. Then the second deposi- 
tion is performed, from the other direction of the colloidal monolayer, to produce 
patches on the other poles of the colloids. The PDMS stamp with colloidal part- 
icles attached is immersed in 2 mM octadecanethiol (Sigma-Aldrich) in ethanol 
for 7h to render the Au coatings hydrophobic. Particles on PDMS stamp are 
rinsed with ethanol multiple times and then redispersed in deionized water via 
ultrasonication. 


LETTER 


Self-assembly. A suspension of triblock Janus particles in deionized water is 
contained in a flat silica cuvette (Lab-Tek II chambered coverglass). Particles 
are repelled from the cuvette bottom by its negative charge. We note that the 
gravitational height of the as-prepared particles, h = kyT/mg ~ 4 um, concentrates 
the dispersed particles into a quasi-two-dimensional system after overnight sedi- 
mentation. Here, T is the room temperature and mg is the buoyant weight of the 
particle, considering both the density of latex particles (1.055 gcm *) and the gold 
coating. The range of hydrophobic attraction is short compared to our particle 
diameter, which the experimental literature reports to be in the range 10-100 nm 
and was successfully modelled with a potential decaying roughly exponentially 
with a decay constant of the order of 10 nm (ref. 23). This short range can enforce 
contact interactions, disfavouring the more loosely bound structures that will 
result for long-range attractions. The salt NaCl is added in order to screen repul- 
sion to a Debye length of ~5 nm to allow the recognition of attraction between 
patches and their consequent self-assembly. Using epifluorescence microscopy 
(63X air objective with a 1.6% post magnification, numerical aperture 0.75) with 
an iXon EMCCD camera, we monitor the dynamic evolution of the system. We 
quantified this dynamical change in structure by manual mapping in Fig. 2d, and a 
combination of particle tracking and calculation of the local two-dimensional 
bond-orientational order parameter in Fig. 3e: 


nn 


ga LS 
9 nn = 


where nn is the number of nearest neighbours of particle j identified from 
Delaunay triangulation. Here (jx is the angle of the bond between particle j and 
its neighbour k to an arbitrary reference axis. This definition of order parameter is 
valid for the kagome lattice because it shares the same arrangement of neighbour 
orientation as a triangular lattice except for the missing particles. Meanwhile, to 
avoid the inclusion of particles locally ordered with six bonds as in a triangular 
lattice, only particles with both —,; > 0.7 and four nearest-neighbouring bonds are 


denoted as particles with local kagome lattice order. 
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Probing the electromagnetic field of a 15-nanometre 
hotspot by single molecule imaging 


Hu Cang"?, Anna Labno**, Changgui Lu’, Xiaobo Yin', Ming Liu’, Christopher Gladden”, Yongmin Liu? & Xiang Zhang"? 


When light illuminates a rough metallic surface, hotspots can 
appear, where the light is concentrated on the nanometre scale, 
producing an intense electromagnetic field. This phenomenon, 
called the surface enhancement effect'?, has a broad range of 
potential applications, such as the detection of weak chemical sig- 
nals. Hotspots are believed to be associated with localized electro- 
magnetic modes**, caused by the randomness of the surface 
texture. Probing the electromagnetic field of the hotspots would 
offer much insight towards uncovering the mechanism generating 
the enhancement; however, it requires a spatial resolution of 
1-2 nm, which has been a long-standing challenge in optics. The 
resolution of an optical microscope is limited to about half the 
wavelength of the incident light, approximately 200-300 nm. 
Although current state-of-the-art techniques, including near-field 
scanning optical microscopy’, electron energy-loss spectroscopy’*, 
cathode luminescence imaging’ and two-photon photoemission 
imaging® have subwavelength resolution, they either introduce a 
non-negligible amount of perturbation, complicating interpreta- 
tion of the data, or operate only in a vacuum. Asa result, after more 
than 30 years since the discovery of the surface enhancement 
effect?'', how the local field is distributed remains unknown. 
Here we present a technique that uses Brownian motion of single 
molecules to probe the local field. It enables two-dimensional 
imaging of the fluorescence enhancement profile of single hotspots 
on the surfaces of aluminium thin films and silver nanoparticle 
clusters, with accuracy down to 1.2nm. Strong fluorescence 
enhancements, up to 54 and 136 times respectively, are observed 
in those two systems. This strong enhancement indicates that the 
local field, which decays exponentially from the peak of a hotspot, 
dominates the fluorescence enhancement profile. 

The study of the surface enhancement effect mirrors the development 
of surface analysis techniques. Previous optical experiments revealed 
that the roughness of the surface has a critical role in determining the 
strength of the enhancement””’. Further experimental’? and theor- 
etical’’ studies on the impact of the surface roughness led to the con- 
nection between the surface enhancement effect and the surface 
plasmon, as well as to the hotspots being termed localized surface plas- 
mon polaritons*'*'*. More recently, near-field scanning optical micro- 
scopy'®, electron energy-loss spectroscopy®, cathodoluminescence 
imaging’, two-photon photoemission imaging* and experiments with 
a waveguide mode excitation’” have shown that, at these hotspots, the 
fluorescence enhancement is confined to a region far smaller than the 
wavelength of light, yet the field ofa single hotspot has not been resolved. 

We developed a single molecule super-resolution imaging method, 
based on the fact that individual fluorescent molecules can be localized 
with single-nanometre accuracy from the optical far field. When mul- 
tiple emitters reside within a diffraction-limited spot, one has to ensure 
that these emitters emit one at a time’**’. Control of the emitting 
sequence can be done by serial photo-switching of fluorescence mol- 
ecules'*””, using techniques such as photo-activated localization micro- 
scopy’® and stochastic optical reconstruction microscopy’. These 


techniques, together with structural illumination microscopy and sti- 
mulated emission depletion microscopy, allow optical resolution 
beyond the diffraction limit in biological samples (for a review, see 
ref. 22). As photo-switching would be infeasible for investigating local 
field, we use the Brownian motion of single dye molecules”*”’”’ in a 
solution to let the dyes scan the surface of single hotspots in a stochastic 
manner, one molecule at a time. 

Figure 1a illustrates the principle of this new technique. Here, a 
sample is submerged in a solution of fluorescent dye. The chamber 
containing the sample is mounted on a total internal reflection (TIRF) 
set-up. As the diffusion of the dye molecules is much faster than the 
image acquisition time (a 1-nm-diameter sphere diffuses through a 
200-nm-wide spot in less than 0.1 ms on average in water at room 
temperature, in contrast to an imaging time of typically 50-100 ms), 
the fluorescence from the rapidly diffusing dye molecules contributes 
only to a homogeneous background. When a dye molecule is adsorbed 
onto the surface of a hotspot, it appears as a bright spot. By using a 
maximum likelihood single molecule localization method” (Sup- 
plementary Information), the molecule can be localized with single- 
nanometre accuracy'**!, and the fluorescence enhancement can be 
deduced from the intensity of the fluorescence. After the dye molecule 
is bleached, which typically occurs within hundreds of milliseconds, 
the fluorescence disappears and the hotspot is ready for the next 
adsorption event. By choosing the right concentration of dye mol- 
ecules, the adsorption rate can be controlled to the point of ensuring 
that only one molecule emits at a time. As we use a camera to record 
single molecule adsorption events, multiple hotspots within a large 
field of view of up to 1 mm” can be imaged in parallel. This is much 
more efficient than raster scanning based techniques, such as near-field 
scanning optical microscopy and electron energy-loss spectroscopy. 

Using this method, we are able to image the fluorescence enhance- 
ment profile of single hotspots as small as 15 nm with an accuracy 
down to 1.2 nm, well beyond the diffraction limit, in just a few minutes 
(Supplementary Movie; Fig. 1b). The hotspots are formed on the sur- 
face of a thin (12-15 nm) aluminium film deposited on a quartz sub- 
strate by electron-beam evaporation. Water facilitates the oxidization 
process”; an oxidized layer can form on the surface of the aluminium 
film and reach a thickness of up to 8 nm (ref. 25). Each sphere in Fig. 1b 
represents a single molecule adsorption event, with the x and y coor- 
dinates representing the location of the molecule’s centroid, and the z 
coordinate corresponding to the intensity of the molecule’s fluor- 
escence. Single molecules can be localized with an accuracy down to 
1.2 nm (Fig. 1c; Supplementary Information). The measured enhance- 
ment here exhibits a rapid exponential decay with a decay constant of 
9.8 nm (Fig. 1d, e). The standard deviation of the location of observed 
single molecule events was used as a model-independent measure of 
the width (15.4 nm) of the hotspot, as shown in Fig. 1f. The existence of 
such a small hotspot demonstrates the presence of tight near-field 
optical confinement. 

The accurate estimation of the hotspot enhancement and electro- 
magnetic field experienced by the molecule is confounded by a number 
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Figure 1 | The principle of Brownian motion single molecule super- 
resolution imaging. a, Hotspots appear on the surface of a thin aluminium 
film under a total internal reflection (TIRF)-type illumination at 532 nm. To 
map the field distribution inside the hotspots, we use the Brownian motion of 
fluorescence dye molecules (Chromeo 546). The dye molecules stochastically 
adsorb to the surface. After a few frames (50-100 ms per frame), the dye 
molecules photobleach, give rise to a ‘blinking’ pattern (Supplementary Movie): 
each blink corresponds to one adsorption-bleaching event. By controlling the 
concentration of the dyes, the adsorption rate can be adjusted to ensure that 
within a diffraction limited spot, only one molecule emits photons at a time; 
therefore the position of the molecule can be determined by a maximum 
likelihood localization method” with accuracy down to 1.2 nm (c). b, By using 
the adsorption locations as the x and y coordinates, and the fluorescence 
intensity as the z coordinate, we obtain a 3D scatter plot of the fluorescence 


of factors, including the molecule’s orientation relative to the polariza- 
tion of local field, and stochastic photobleaching (Supplementary Infor- 
mation). To decrease the effect of those stochastic variables on the final 
shape of the profile, we use a Gaussian kernel method to render the 
image of a hotspot’®, as shown in Fig. 2a. Each pixel X of the rendered 
image, I(X), is a weighted average of the intensity from all of the single 
molecule events, with molecules closer to X carrying more weight 
(Fig. 2a). The window size of the kernel is determined by the accuracy 
of the single molecule localization. The rendered image reports an aver- 
aged profile of the local fluorescence enhancement. A hotspot may have 
complex structures that are smaller than the size of the kernel; these fine 
structures are removed in the stochastic rendering process (Supplemen- 
tary Information). The exponential decay is evident, with the full-width 
at half-maximum (FWHM) measured as ~20 nm (Fig. 2b). 

By comparing the fluorescence from the dye molecules adsorbed on 
the hotspot to that from the dye molecules immobilized on the surface 
of a quartz slide under the same conditions (Supplementary Informa- 
tion), we determined that the fluorescence enhancement is about 36 
times greater at the centre of the 20-nm hotspot. This modest enhance- 
ment results from the high ohmic loss of aluminium at the visible 
wavelength involved. Among the total of 60 hotspots that were ana- 
lysed, we found a broad distribution of enhancement factors and sizes, 
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enhancement profile of the hotspot, with each sphere representing one single 
molecule event. c, The accuracy of the reconstructed field profile, estimated 
from the variance of the maximum likelihood localization” (Supplementary 
Information), depends on the number of photons collected from the molecules, 
with brighter single molecule events showing better accuracy, down to 1.2 nm. 
The molecules within —2 nm < y <2 nm are shown in d, which represents a 
cross-section of the hotspot at y= 0 nm. A similar plot of the cross-section of 
the hotspot at x = 0 nm is shown in e. The envelope appears as an exponential 
decay with a constant of 9.83 nm (red dashed lines). To avoid crowding, only 
the single molecule localization variance of a few spheres near the envelope is 
shown. f, The distribution of the single molecule events provides a direct 
measure of the size of the hotspot; the width of the hotspot characterized by the 
standard deviation of the single molecule events is 15.2 nm and 15.4nm in x 
and y directions, respectively. 


with approximately 32 nm as the average size of the spots (Fig. 2c). The 
maximum fluorescence enhancement factor of the hotspots was found 
to depend inversely on their size; the largest enhancement factor we 
observed was 54 times (for a 15-nm hotspot); this results from the 
tighter confinement of the electromagnetic field in a smaller hotspot 
(Fig. 2d). 

As hotspots also appear in metal nanoparticle clusters’*"*, and the 
mechanism has been postulated to be similar to that of the metal thin 
films, we investigated the hotspots that formed in silver clusters con- 
sisting of silver nanoparticles with average diameter of 40 nm. In addi- 
tion to the oxidation layer of silver”®, the dispersing surfactant (that 
comes with the silver nanoparticle suspension) condenses on the sur- 
face of the silver nanoparticle clusters during the aggregation process. 
We found that the fluorescence enhancement profile of the hotspots 
formed in the silver clusters has an exponential shape, similar to that of 
hotspots on the surface of an aluminium film (Fig. 3a, b). The electro- 
magnetic field is strongly confined in a hotspot within an elliptical region 
of 13.2 nm X 20.3 nm, more than 30 times smaller than the wavelength 
of the excitation laser (Fig. 3c). 

The fluorescence enhancement includes contributions from both local 
field and quenching. Experiments have shown that the quenching— 
transfer of energy from molecules to the metal nanostructure”’—is 
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Figure 2 | Hotspots on an aluminium film. a, As the fluorescence of a single 
molecule is intrinsically stochastic, we remove this randomness by using a 
Gaussian kernel method to render the image of the field distribution. A hotspot 
on the surface of an aluminium film is shown. Each pixel X of the rendered 
image corresponds to average intensity from all of the single molecule events, 
with molecules closer to X carrying more weight. The kernel window size is 
2.1 nm; this small window size makes the image appear noisy. An exponential 
decay field profile is visible, and is more evident on a log scale, shown in b as 
almost a decade of straight line (red solid line). The blue and green curves are 
two cross-sections of the hotspot along x and y directions through the peak. The 
FWHM of the spot is ~20 nm. ¢, All of the hotspots observed are of deep sub- 
wavelength size, with an average width of 32.3 nm. By plotting the 
enhancement factor of the hotspots against their width (d), an inverse 
relationship between the size and the enhancement factor is visible: tighter 
confinement leads to stronger enhancement. 


short-ranged compared to the local field. Therefore there exist two 
distinct regimes above the surface of a metal: a quenching dominated 
regime near the surface, followed by a local field dominated regime 
further away from the surface**”; the transition point between the two 
regimes coincides with the peak of the fluorescence enhancement. This 
peak ranges from a few nanometres to tens of nanometres, depending 
on the materials and the geometry of the nanostructures*®**. The 
dielectric layers formed on the surface of the metal shift the adsorbed 
layer of molecules into the local field dominant regime, contributing to 
the strong (up to 54 times (Al) and 136 times (Ag)) fluorescence 
enhancement observed. To reproduce the effect of the dielectric layers, 
we deposited a ~9-nm-thick spacer layer of self-assembled protein 
molecules on the surface of silver nanoparticle clusters in situ, in addi- 
tion to the original dielectric layer (Supplementary Information). As 
the transition point of silver nanoparticle clusters is ~2.5 nm above the 
surface in a similar system”®, the observation will be in the local field 
dominant regime. Comparing the same hotspot without and with the 
spacer layer, we found an ~36% drop in the fluorescence enhancement, 
an ~26% increase in width, and an essentially unchanged exponential 
shape (Supplementary Information), which confirms that an exponen- 
tial profile is a general feature of the local electromagnetic field. 

The single molecule super-resolution approach is a generic tech- 
nique, which offers a unique, perturbation free capability for imaging 
the electromagnetic field enhancement of optical nanostructures 
with single-nanometre precision. Using this technique, we have 
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Figure 3 | A hotspot on silver nanoparticle clusters. a, A hotspot formed on 
silver nanoparticle clusters appears similar to those formed on the aluminium 
film. A 644-nm laser is used for excitation, and Chromeo 642 dye (Active 
Motif)—whose emission centres around 660 nm—is used. The maximum 
enhancement factor at the centre of the peak corresponds to 136 times the 
florescence from the same dye molecules adsorbed on a glass surface. b, The 
hotspot exhibits a similar exponential decay profile as those formed on the 
surface of the aluminium film. Two cross-sections of the hotspot along x 
(green) and y (blue) directions through the peak are plotted on a logarithmic 
scale, with the solid red lines as eye-guides for the exponential profile. c, The 
widths of the hotspot, estimated from the distribution of the single molecule 
events on a scatter plot, are 13.2 nm and 20.3 nm along the two axes. 


demonstrated the first (to our knowledge) direct measurement of a 
single hotspot as small as 15 nm and with an accuracy down to 1.2 nm. 
Such measurements will accelerate understanding of localized electro- 
magnetic modes. The exponential profile observed sheds new light on 
the much-debated mechanism of extraordinary field confinement in a 
two-dimensional disordered system where Anderson-localized modes*”* 
could emerge. Although signatures of the Anderson-localized mode 
have been reported in two-dimensional disordered metallic systems’, 
its hallmark, an exponential profile, had not been directly observed up 
until now. Additionally, other imaging modalities can be integrated to 
provide spectroscopy as well as fluorescence lifetime information on 
the molecules; this approach could be used to investigate the strong- 
coupling regime such as occurs with resonant nano-antennas*”*. In 
the strong-coupling regime, the distributed dipoles in the antenna 
could significantly affect the fluorescence of the molecules, and the 
strength of the coupling cannot be directly measured from the intensity 
of the fluorescence signals. A super-resolution measurement of the 
spatial distribution of the molecules’ fluorescence lifetime will help 
visualization and understanding of the coupling. Furthermore, with 
the development of brighter fluorophores, better photo-bleaching 
suppression methods, and more efficient three-dimensional super- 
resolution techniques, three-dimensional imaging will also be realized. 


METHODS SUMMARY 


Details of the experiment can be found in Methods. There we describe the fabrica- 
tion of the aluminium film on a quartz slide, the self-assembly of silver nanoparticle 
clusters on a glass cover slide, the prism-TIRF experiment for the observation of the 
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hotspots on the aluminium film, the objective-TIRF experiment for the observation 
of the hotspots in silver nanoparticle clusters, the single-molecule localization 
method used to determine the adsorption centre and the fluorescence intensity of 
each single-molecule events, and the Gaussian kernel method for image rendering. 
More discussion on the single-molecule localization and the imaging rendering 
methods can be found in the Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Observations of hotspots on aluminium films. A quartz slide (3 inch X 1 inch, 
Ted Pella or SPI) is thoroughly cleaned according to a protocol developed at Radio 
Corporation of America (RCA protocol), before deposition. First, the slide is soaked 
in a bath of a base solution (distilled water, hydrogen peroxide and ammonium 
hydroxide at a ratio of 4:1:1) for 15 min; it is then transferred to an acid bath (distilled 
water, hydrogen peroxide and hydrogen chloride at a ratio of 6:1:1) for another 
15 min, followed by 15 min of sonication in filtered distilled water (Milipore). The 
slide is then blow-dried with pure nitrogen gas and placed immediately inside an 
electron-beam evaporation chamber (Torr International). After the vacuum is 
pumped down to a minimum of 10° torr, deposition is started at a deposition rate 
of 0.1nms_’. A crystal detector monitors the thickness of the film in real time. The 
deposition stops once the thickness reaches 15 nm. 

A flow chamber is made by sandwiching an aluminium coated quartz slide anda 
clean glass cover slip using double-sided tape. The glass cover slips (no. 0, Ted 
Pella) are cleaned following the same procedure used for cleaning the quartz slide 
and stored in a clean jar filled with filtered distilled water before use. The chamber 
is placed ona home-built prism type total internal reflection (TIRF) microscope. A 
200-mW 532-nm laser (Lambda Service) is used as the excitation light. The 
intensity is adjusted by neutral density filters to about 7 mW measured immedi- 
ately before the prism. A 1-nm laser line band pass filter (Edmunds Optics), 
centred at 532 nm, is used to clean the excitation light that then impinges on a 
quartz Pellin-Broca prism (Thorlabs) and excites an evanescent wave on the 
surface of the sample. A 60X, NA 1.25 objective lens (Zeiss) is used to collect 
the light, and an electron multiplying charge coupled device (EMCCD Cascade 
512, Photometric) is used to record fluorescent images. A 532-nm long pass filter 
(Semrock) and a band pass filter whose centre wavelength and bandwidth are 
582 nm and 80 nm, respectively (582/80m from Chroma Technology), are placed 
after the objective lens. A 4X magnifier (Nikon) is placed right in front of the 
EMCCD in order to make the pixel size 72 nm. The camera’s exposure time is 
80 ms. 

Filtered distilled water (Milipore) is first poured into the chamber. 10nM 
Chromeo 542 dye (Active Motif) in filtered distilled water (Milipore) is then 
flowed into the chamber. Other dyes, including Cy3 (GE Healthcare), and 
Alexa-555 (Life technology) yield similar results. Immediately after the dye is 
flowed in, the fluorescence background increases significantly. Blinking at some 
spots was immediately observed and recorded. To avoid errors caused by the slow 
drifting of the objective lens, the total recording time is limited to a few minutes. To 
compare the enhancement factor, a similar experiment is performed in a quartz 
flow chamber without aluminium film coating, with a dye concentration of 
~1nM. 

Observations of hotspots on silver nanoparticle clusters. Glass cover slips (no. 
1.5, Fisher Scientific) are cleaned following the same procedure used for cleaning 
the quartz slide. The cleaned cover glass is stored in a clean jar filled with filtered 
distilled water before use. A drop of 0.1 ml concentrated solution of 40 nm silver 
colloids (Ted Pella) is coated on a cleaned cover glass. As the water evaporates, a 
layer of nanoparticle clusters formed on the surface. A home-built objective type 
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TIRF with a Nikon TIRE objective (NA 1.49, 100X) is used. The excitation beam 
from a 40-mW 644-nm laser (Coherent, Cube) is reflected by a dichroic mirror 
with transition edge at 650 nm (650DCLP Chroma) into the objective lens. The 
power at the objective lens is measured to be ~4mW. An emission filter (710/ 
130m Chroma), whose centre and bandwidth are 710 nm and 130 nm respectively, 
is used before an EMCCD camera (Cascade 512, Princeton Instruments). To 
reduce the pixel size, a 4X magnifier (Nikon) is placed right in front of the 
EMCCD. The final pixel size corresponds to 48 nm per pixel. 10 nM Chromeo 
642 dye (Active Motif) solution is used for the single molecule super-resolution 
imaging. The exposure time of the camera is 20 ms. 

Single-molecule localization. This is performed by using a maximum likelihood 
estimation method”. Briefly, the log likelihood function 


ys, (—E; +n; log E;— log(n;!)) 


where n; is the observed number of photons at pixel i, and E; is the expected 
number of photons calculated from the fitting parameters, is minimized. The 
accuracy of the estimation, determined from the variance, is: 
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Since each pixel of a camera has a finite size of a, the width of the Gaussian PSF has 
to be corrected as o,” = 0° + a’/12. N is the total number of photons observed 
from the rawimage. The integration in the bracket accounts for the shot noise from 
the background photons as t = 2a,"b*/Na’, where b is the average background 
photon count™. The factor of 2 corrects the gain noise from an EMCCD camera. 
We performed the single molecule localization in two steps: a least squares 
fitting of the raw image to a 2D Gaussian function is first used to obtain the initial 
parameters, and then the parameters are fed into the maximum likelihood estima- 
tion method to extract the centroid position x and y, and the amplitude, which are 
then used to generate the 3D scatter plot shown in Fig. 1. 
Gaussian kernel image rendering. The collected single molecule events distribute 
in a random and sparse manner in space. As it is difficult to apply a fixed mesh 
directly for image rendering, we use a Gaussian Kernel rendering procedure. 
Each image spot is analysed by a maximum likelihood estimation method” to 
extract the centroid position x, y and the intensity I. We then assign each single 
molecule event, represented by a colour sphere in the 3D scatter plot, to a Gaussian 
function, with the peak position as x and y, and the width as o, determined from 
the accuracy of the single molecule localization by equation (1). The Gaussian 
function thus represents the probability of finding the true location of the single 
molecule event in space. 


I(x)= yok exp(—(x—x))’/20")/ > exp(—(x—x;)"/207) 


Multiplying each Gaussian function with the intensity J; from the fitting, summing 
them up, and normalizing the sum by the mean number of events, provides a 
smooth 3D image of the enhancement profile. 
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Grains and grain boundaries in single-layer graphene 


atomic patchwork quilts 
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The properties of polycrystalline materials are often dominated by 
the size of their grains and by the atomic structure of their grain 
boundaries. These effects should be especially pronounced in two- 
dimensional materials, where even a line defect can divide and 
disrupt a crystal. These issues take on practical significance in 
graphene, which is a hexagonal, two-dimensional crystal of carbon 
atoms. Single-atom-thick graphene sheets can now be produced by 
chemical vapour deposition’” on scales of up to metres*, making 
their polycrystallinity almost unavoidable. Theoretically, graphene 
grain boundaries are predicted to have distinct electronic®*, 
magnetic’, chemical’® and mechanical’ properties that strongly 
depend on their atomic arrangement. Yet because of the five-order- 
of-magnitude size difference between grains and the atoms at grain 
boundaries, few experiments have fully explored the graphene 
grain structure. Here we use a combination of old and new trans- 
mission electron microscopy techniques to bridge these length 
scales. Using atomic-resolution imaging, we determine the loca- 
tion and identity of every atom at a grain boundary and find that 
different grains stitch together predominantly through pentagon- 
heptagon pairs. Rather than individually imaging the several 
billion atoms in each grain, we use diffraction-filtered imaging™* 
to rapidly map the location, orientation and shape of several 
hundred grains and boundaries, where only a handful have been 
previously reported’*-”’. The resulting images reveal an unexpectedly 
small and intricate patchwork of grains connected by tilt boundaries. 
By correlating grain imaging with scanning probe and transport 
measurements, we show that these grain boundaries severely 
weaken the mechanical strength of graphene membranes but do 
not as drastically alter their electrical properties. These techniques 
open a new window for studies on the structure, properties and 
control of grains and grain boundaries in graphene and other 
two-dimensional materials. 

Figure 1a shows a large array of the suspended, single-layer graphene 
membranes used in this study. We grew predominately single-layer 
graphene films on copper foils by chemical vapour deposition’ 
(CVD) using three different growth recipes, which we refer to as growth 
methods A, B and C. Unless otherwise stated, all data were taken on 
graphene grown with method A, which was similar to the recipe 
reported in ref. 1. Methods B and C are slight variations: method B uses 
ultrapure copper foils'® (99.999% pure rather than 99.8%) and method 
C uses a rapid thermal processor furnace (Methods). These films were 
transferred onto holey silicon nitride or Quantifoil transmission elec- 
tron microscopy (TEM) grids using two different techniques (Methods 
and Supplementary Information). One key innovation over previous 
graphene TEM sample fabrication” was the gentle transfer of the gra- 
phene onto a TEM grid using a minimum of polymer support and 
baking the samples in air to remove the polymer without liquid solvents. 


This produces large arrays of free-standing graphene sheets covering up 
to 90% of the TEM grid holes. 

To characterize these membranes at the atomic scale, we used 
aberration-corrected annular dark-field scanning transmission elec- 
tron microscopy (ADF-STEM), where a 60-keV, angstrém-scale elec- 
tron beam is scanned over the sample while the medium- to high-angle 
scattered electrons are collected. Keeping the electron beam voltage 
below the ~100-keV graphene damage threshold was necessary to 
limit beam-induced damage. Properly calibrated, this technique 
images the location and atomic number” of each atom and, along 
with TEM, has been used to study the lattice and atomic defects of 
graphene and boron nitrene’*’!”’. Figure 1b shows an ADF-STEM 
image of the crystal lattice within a single graphene grain. Away from 
the grain boundaries, such regions are defect free. 

In Fig. 1c, two graphene grains meet with a relative misorientation of 
27°, forming a tilt boundary. Additional images of grain boundaries are 
shown in Supplementary Figs 2c and 3. As highlighted in Fig. 1d, the two 
crystals are stitched together by a series of pentagons, heptagons and 
distorted hexagons. The grain boundary is not straight, and the defects 
along the boundary are not periodic. Although the boundary dislocation 


Figure 1 | Atomic-resolution ADF-STEM images of graphene crystals. 

a, Scanning electron microscope image of graphene transferred onto a TEM 
grid with over 90% coverage using novel, high-yield methods. Scale bar, 5 um. 
b, ADF-STEM image showing the defect-free hexagonal lattice inside a 
graphene grain. c, Two grains (bottom left, top right) intersect with a 27° 
relative rotation. An aperiodic line of defects stitches the two grains together. 
d, The image from c with the pentagons (blue), heptagons (red) and distorted 
hexagons (green) of the grain boundary outlined. b-d were low-pass-filtered to 
remove noise; scale bars, 5 A. 
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resembles structures proposed theoretically''’’, its aperiodicity con- 


trasts with many of these models and will strongly affect the predicted 
properties of grain boundaries. By analysing atomic scattering intens- 
ities’, we confirm that the boundary is composed entirely of carbon. In 
addition, although high electron beam doses could induce isolated 
bond rotations (Supplementary Fig. 3), the boundary was largely stable 
under the 60-keV electron beam. Thus, the polycrystalline graphene is a 
strongly bonded, continuous carbon membrane. We also note that many 
grain boundaries are decorated by lines of surface particles and adsor- 
bates (Supplementary Fig. 4), suggesting that, as predicted”°, they may be 
more chemically reactive than the pristine graphene lattice. 

Both STEM and TEM, which determine the positions and identities of 
atomic nuclei, and complementary scanning tunnelling microscopy, 
used to probe valence wavefunctions'*”, are invaluable for understand- 
ing the local properties of grain boundaries. Using these atomic-resolu- 
tion approaches, however, tens of billions to hundreds of billions of 
pixels would be needed to image even a single micrometre-scale grain 
fully, with estimated acquisition times of a day or more. Other candi- 
dates for characterizing grains on larger scales, such as low-energy elec- 
tron microscopy'® and Raman microscopy’, typically cannot resolve 
small grains and may be difficult to interpret. Fortunately, electron 
microscopy offers an ideal technique for imaging grains on the necessary 
length scales: dark-field TEM (DF-TEM), which is a high-throughput, 
diffraction-sensitive imaging technique* that can be implemented on 
most TEMs built in the past sixty years. This method is usually applied 
to foils about 100-300-nm thick'*, but we demonstrate below that, 
remarkably, it also works on single-atom-thick sheets—even on samples 
too dirty for atomic-resolution imaging. In this manner, DF-TEM pro- 
vides a nanometre- to micrometre-scale grain analysis that comple- 
ments ADF-STEM to give a complete understanding of graphene 
grains on every relevant length scale. 

Figure 2a, b shows a bright-field TEM image of a graphene sheet 
along with the selected-area electron diffraction pattern created from 
this region of the membrane. Owing to graphene’s six-fold symmetry, 
electron diffraction from a single graphene crystal results in one set of 
six-fold-symmetric spots. Figure 2b contains many such families of 
spots, indicating that the field of view contains several grains of dif- 
ferent orientations. DF-TEM images these grains one by one with few- 
nanometre resolution using an objective aperture filter in the back 
focal plane to collect electrons diffracted through a small range of 
angles, as shown by the circle in Fig. 2b. The resulting real-space image 
(Fig. 2c) shows only the grains corresponding to these selected in- 
plane lattice orientations and requires only a few seconds to acquire. 
By repeating this process using several different aperture filters, then 
colouring and overlaying these dark-field images (Fig. 2d, e), we create 
complete maps of the graphene grain structure, colour-coded by lattice 
orientation, as shown in Fig. 2e-g. 
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Figure 2 | Large-scale grain imaging using DF- 
TEM. a-e, Grain imaging process. a, Samples 
appear uniform in bright-field TEM images. 

b, Diffraction pattern taken from a region in 

a reveals that this area is polycrystalline. Placing an 
aperture in the diffraction plane filters the scattered 
electrons forming c, a corresponding dark-field 
image showing the real-space shape of these grains. 
d, Using several different aperture locations and 
colour-coding them produces e, a false-colour, 
dark-field image overlay depicting the shapes and 
lattice orientations of several grains. f, g, Images of 
regions where many grains emanate from a few 
points. Scale bars, 500 nm. 


The images obtained are striking. The grains have complex shapes 
and many different crystal orientations. In Fig. 2e-g, we observe special 
locations from which many grains emanate. Small particles and multi- 
layer graphene also are often found near these sites; see, for example, 
Fig. 2e, top right. Both the average spacing (2-4 |1m) and shapes of these 
radiant sites when we use growth method A are comparable with 
Raman and scanning electron microscope observations of graphene 
nucleation'’, suggesting that these locations are probably nucleation 
sites. Similar structures have been observed in studies of crystallization 
in colloids and are consistent with crystallization around impurities”. 
Similar multigrain nucleation on copper has recently been observed 
using low-energy electron microscopy”*. Significantly, each apparent 
nucleation site gives rise to many grains of different orientations, result- 
ing in a mean grain size much smaller than the nucleation density. 

The distributions of grain size and relative angular orientation are 
readily determined from DF-TEM images. As discussed below, grain 
sizes are dependent on growth conditions, here ranging from hundreds 
of nanometres to tens of micrometres for slight changes in growth con- 
ditions. In Fig. 3a, we plot a histogram of grain sizes across several 
samples grown using method A. The mean grain size, defined as the 
square root of the grain area, is 250 + 11 nm (s.e.m.). This size is much 
smaller than the grain size of the copper substrate’* (100 {um-1 mm) and 
typical lateral grains measured in bulk, highly ordered pyrolytic graphite” 
(6-30 um). The inset in Fig. 3a shows the cumulative probability of 
finding multiple grains in a given area. This plot demonstrates that 
micrometre-scale CVD graphene devices produced from this set of films 
will nearly always contain multiple grains. Figure 3b shows a histogram of 
the relative crystallographic angles between adjacent grains. Because of 
graphene’s six-fold crystal symmetry, the diffractive imaging technique 
only determines grain rotations modulo 60°. Consequently, the mea- 
surable difference between grain orientations is from 0 to 30° (with, for 
example, 31° measured as 29°). We observe a surprising and robust 
preference for low-angle (~7°) grain boundaries and high-angle 
(~30°) boundaries similar to that seen in Fig. 1. 

Additional information about these orientations comes from the 
larger-area diffraction patterns in Fig. 3c, which we created by averaging 
diffraction data sampled across 1,200-11m? regions of graphene. The 
broadened diffraction peaks in Fig. 3c (left) show a distinct six-fold 
pattern, indicating that a significant fraction of the grains are approxi- 
mately aligned across large areas This alignment can also be seen in 
Fig. 3d, which is a low-magnification DF-TEM image showing grains 
with a small (~ 10°) range of in-plane lattice orientations. Almost half of 
the membrane appears bright, indicating that these grains are all 
approximately aligned. In contrast, a dark-field image of randomly 
oriented grains would only show roughly one-sixth (10°/60°) of the 
graphene membrane. In the diffraction pattern of a separately grown 
sample (Fig. 3c, right), we instead find a clear 12-fold periodicity, 
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Figure 3 | Statistical analysis of grain size and orientation. a, Histogram of 
grain sizes, taken from three representative samples using DF-TEM. The mean 
grain size is 250 + 11 nm (s.e.m., n = 535). Inset, plot of the cumulative 
probability of having more than one grain given the area of a device. 

b, Histogram of relative grain rotation angles measured from 238 grain 
boundaries. c, d, Large-area diffraction patterns (c) and a low-magnification 
DF-TEM image (d) show that grains are globally aligned near particular 
directions. Scale bar, 2 um. 


indicating that there are two main families of grains rotated from one 
another by 30°. These distributions, which often contain smaller sub- 
peaks (Supplementary Fig. 6), are consistent with the frequent obser- 
vation of low-angle and high-angle (~30°) grain boundaries. We 
attribute these alignments to registry to the copper substrate used for 
graphene growth. Such registry has recently been observed in low- 
energy electron microscopy and scanning tunnelling microscopy studies 
of graphene growth on copper (100) and (111) surfaces’*'®"*. 

By directly correlating grain structure with growth methods, these 
DF-TEM methods can be used to build on recent studies* that have 
demonstrated links between island nucleation density and growth 
conditions. Fig. 4a—c shows three composite DF-TEM images of 
graphene grown using methods A, B and C. The slight differences 
between growth methods effected significant changes in the grain size, 
shape and crystallographic orientation of the CVD graphene. For 
example, with growth method C we observed grains averaging 
1-4 um (Fig. 4c), which is an order of magnitude larger than the grains 
grown using method A. Our DF-TEM methods provide a powerful 
characterization tool for understanding and controlling grain growth, 
which will bea rich field of study important for graphene applications. 

The ability to image the grain structure in graphene monolayers easily 
opens the door to the systematic exploration of the effects of grain 
structure on the physical, chemical, optical and electronic properties of 


Growth method A b Growth method B 


Figure 4 | Grain structure and mobilities for three growth conditions. 

a-c, Composite DF-TEM images of grain structure show variations with growth 
condition. The mean grain sizes are 250 + 11 nm (s.e.m.; growth method A, 
99.8% pure copper), 470 + 36 nm (s.e.m.; growth method B, 99.999% pure 
(ultrapure) copper) and 1.7 + 0.15 um (s.e.m.; growth method C (rapid thermal 
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graphene membranes. We find that such studies are further facilitated 
because grain boundaries are visible in scanning electron microscopy 
and atomic force microscopy (AFM) phase imaging owing to preferen- 
tial decoration of the grain boundaries with surface contamination 
(Fig. 5a and Supplementary Figs 9 and 10). Below, we show two examples 
probing the electrical and mechanical properties of grain boundaries. 

We first examine the failure strength of the polycrystalline CVD gra- 
phene membranes (growth method A) using AFM. We used AFM phase 
imaging to image grains (Fig. 5a) and then pressed downwards with the 
AFM tip to test the mechanical strength of the membranes. As seen in 
Fig. 5b, the graphene tears along the grain boundaries. From repeated 
measurements, we find that failure occurs at loads of ~100 nN, which is 
an order of magnitude lower than typical fracture loads of 1.7 UN 
reported for single-crystal exfoliated graphene’. Thus, the strength of 
polycrystalline graphene is dominated by its grain boundaries. 

We probed the electrical properties of polycrystalline graphene by 
fabricating electrically contacted devices using graphene from all three 
growth methods. Figure 4d shows a histogram of mobilities extracted 
from four-point transport measurements. Devices grown using 
methods A, B and C have room-temperature mobilities of 1,000 + 750, 
7,300 + 1,100 and 5,300 + 2,300cm?V s 1(s.d.), respectively. The 
mobilities for growth method A are comparable to previous results on 
CVD graphene’, whereas the mobilities of growth methods B and C 
are closer to those reported for exfoliated graphene’ (1,000- 
20,000 cm? V! s '). By comparing these measurements with the cor- 
responding DF-TEM images in Fig. 4a—c, we are surprised to find that, 
although mobility is clearly affected by growth conditions, high mobility 
does not directly correlate with large grain size. 

To complement these bulk electrical measurements, we used scan- 
ning probe a.c. electrostatic force microscopy (AC-EFM) to test the 
resistivity of individual grain boundaries. We fabricated suspended gra- 
phene membrane devices”. One of these is shown schematically in 
Fig. 5c, where we also plot the relative potential along a graphene mem- 
brane between two biased electrodes, measured using AC-EFM. In this 
plot, high-resistance grain boundaries would manifest as sharp drops in 
potential. The graphene in these devices (growth method A) hada mean 
grain size of 250 nm, so a line scan across these 3-yym-long membranes 
should cross an average of 12 grains. However, no noticeable potential 
drops were detected, indicating that most grain boundaries in these 
devices are not strongly resistive interfaces. By assuming that the grain 
boundary runs perpendicular to the line scan, we estimate an upper 
bound on the grain boundary resistance of Rg < 60 Q jum/L, where L is 
the length of the grain boundary, to be compared with the sheet resist- 
ance Of Rgraphene = 700 Q/C1 for the entire device. In other words, the 
resistance of the grain boundaries is less than one-third the resistance of 
a 250-nm grain. Further measurements on six additional graphene 
membranes, both suspended and unsuspended, and from different 
growth methods, produced similar results. This small impact of grain 
boundaries stands in stark contrast to other materials, such as complex 
oxides, where a single grain boundary can lead to a million-fold increase 
in resistance over single crystals”. 


Growth method C d io 


Hg VethodA 
Hl Method B 
[iG Method C 


5,000 
ue (cm? V-1 $1) 


10,000 


anneal)). The graphene is visible through the 20-nm, perforated amorphous- 
carbon Quantifoil support film. The graphene is broken over three of the 
perforations in a. Scale bars, 2 um. d, Vertically stacked histogram of room- 
temperature mobilities, 1, measured from 39 devices using graphene growth 
methods A, B, and C. N, number of devices. See Methods for further details. 
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Figure 5 | AFM indentation and AC-EFM studies of graphene grain 
boundaries. a, b, AFM phase images of a graphene grain before and after an 
indentation measurement. a, Indentation takes place at the centre of this grain 
as shown by the arrow. b, The region is torn along grain boundaries after 
indentation. Scale bars, 200 nm. ¢, Electrostatic potential, averaged over three 
adjacent line scans along a suspended graphene sheet between two electrodes 
(schematic at top) and measured using AC-EFM. Although on average each 
line scan should cross 12 grains, no measureable features are present. Dashed 
lines indicate the locations of the electrodes. 


The imaging techniques reported here provide the tools to char- 
acterize graphene grains and grain boundaries on all relevant length 
scales. These methods will be crucial both for exploring synthesis 
strategies to optimize grain properties and for studies, such as those 
described above, on the microscopic and macroscopic impact of grain 
structure on graphene membranes. Thus, these results represent a 
significant step forward in realizing the ultimate promise of atomic 
membranes in electronic, mechanical and energy-harvesting devices. 


METHODS SUMMARY 

TEM/STEM. We did ADF-STEM imaging using a NION UltraSTEM100 with 
imaging conditions similar to those used in ref. 21. At 60 kV, using a 33-35-mrad 
convergence angle, the electron probe was close to 1.3 A in size and did not damage the 
graphene. Images presented in Figs 1-4 were acquired with the medium-angle annular 
dark-field detector with acquisition times of between 16 and 32 [1s per pixel. For TEM 
imaging, we used a FEI Technai T12 operated at 80 kV. Acquisition times for dark- 
field images were 5-10 s per frame. The spatial resolution in dark-field images ranges 
from 1 to 10 nm and is set by the size of the objective filtering aperture, in a trade-off 
between real-space resolution and angular resolution in reciprocal space. 

Scanning probe measurements. For AFM deflection measurements, we used a 
MFP3D scope from Asylum Research. We used silicon AFM probes (Multi75Al, 
Budget Sensors) with a resonant frequency of ~75 kHz, a force constant of ~3 Nm! 
and a tip radius of <10 nm. All imaging was done in tapping mode. For AC-EFM 
measurements, a DI 4100 AFM with a signal access module was operated in lift mode 
with a constant probe tip voltage, Vi, = 2 V, a lift height of 10 nm and no piezo drive 
on the tip. An a.c. voltage of Vo = 1 V was applied through the electrodes at the 
resonance frequency of the EFM cantilever, fan, ~ 77 kHz. An electrostatic force 
drives the EFM cantilever to resonate, and the amplitude of motion is measured. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

ADF-STEM. ADF-STEM imaging was conducted using a NION UltraSTEM100 
operated at 60 kV. Imaging conditions were similar to those used in ref. 21. Using a 
33-35-mrad convergence angle, our probe size was close to 1.3 A. Because the low- 
voltage electron beam was below the damage threshold energy’’, the pristine 
graphene lattice remains stable and defect free. High electron beam doses could 
induce isolated bond rotations at grain boundaries (Supplementary Fig. 4) similar 
to those seen in ref. 32. Images presented in Figs 1-4 were acquired with the 
medium-angle annular dark-field detector with acquisition times of between 16 
and 32 ks per pixel. 

DF-TEM. TEM imaging was conducted using a FEI Technai T12 operated at 
80 kV, which did not cause any apparent damage to the graphene membranes. 
Acquisition time for dark-field images were 5-10 per frame. The spatial resolu- 
tion in dark-field images ranges from 1 to 10nm and is set by the size of the 
objective filtering aperture in a trade-off between real-space resolution and angular 
resolution in reciprocal space. 

AC-EFM. A DI 4100 AFM with a signal access module was operated in lift mode 
with tip voltage Vij) = 2 V, a lift height of 10 nm and no piezo drive on the tip. Ana.c. 
voltage Vy = 1 V was applied through the electrodes at the resonance frequency of 
the EFM cantilever, feant ~ 77 kHz. An electrostatic force drives the EFM cantilever 
to resonate, and the amplitude of motion is measured. 

AEM imaging and deflection measurements. For AFM deflection measure- 
ments, we used a MFP3D scope from Asylum Research. We used silicon AFM 
probes (Multi75Al, Budget Sensors) with a resonant frequency of ~75 kHz, a force 
constant of ~3 Nm‘ anda tip radius of <10 nm. Allimaging was done in tapping 
mode. Images were taken with resolutions of 512 X 512 or 1,024 X 1,024, with 
acquisition times of at most 10 min. 

Graphene growth. We grew single-layer graphene using CVD on copper foils in 
three ways. Growth method A: similar to methods described in ref. 1, we annealed 
a 99.8% pure copper foil (Alfa Aesar #13382) at 1,000 °C at low pressure with an Hy 
flow of 7 standard cubic centimetres per minute (s.c.c.m.) for 10 min. We then 
grew the graphene at 1,000 °C by flowing CH,:H, at 150:7 s.c.c.m. for 10-15 min 
(varying growth time within this range did not yield noticeably different results). 
Samples are cooled for ~50 min while the CH4:H) flow is maintained. Growth 
method B: this is identical to method A, except we used higher purity (99.999%) 
copper foil (Alfa Aesar #10950). Growth method C: we used a rapid thermal 
processor tube furnace with a ~4'’ inner diameter (MTI Corporation). We 
annealed copper foil (99.8% purity) at 1,000°C (H2, 300s.c.c.m.) for 30 min, 
and then grew the graphene at 1,000 °C (CH4:H>, 875:300 s.c.c.m.) for 60 min. 
Samples for DF-TEM. We transferred the graphene either to commercial holey 
SiN TEM grids (such as PELCO Holey Silicon Nitride Support Films) with 2.5-,1m- 
diameter holes or to Quantifoil holey carbon TEM grids to allow imaging of larger 
grains. Quantifoil grids are typically 10-20 nm thick, which is thin enough to allow 
DF-TEM imaging through the carbon support. 

The fabrication for DF-TEM samples is a gentle graphene transfer method using a 
thin PMMA support, which produced roughly 90% coverage of TEM grid holes 
(that is, 90% of grid holes were uniformly covered with suspended graphene). After 
graphene growth ona copper foil, a thin layer of PMMA was spun onto the graphene 
(2% in anisole, 4,000 r.p.m. for 30 s), without a post-baking step. Copper was then 
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etched away by floating the foil, PMMA side up, in a HCl/FeCl; copper etchant 
(Transene, Type 100/200). Next, the graphene and polymer support were washed by 
transferring them to deionized-water baths, taking care to not bring the PMMA into 
contact with liquids, to avoid depositing unwanted residues on the PMMA side of 
this layer. Finally, the PMMA-graphene layer is scooped out in pieces onto TEM 
grids. PMMA can be thermally decomposed”’, which is a gentler process than using 
liquid solvent rinses. We baked our samples in air (350 °C for 3-4 h), without the use 
of an argon flow, which can slow the cleaning effect substantially. This step removes 
the PMMA layer, leaving the graphene freely suspended in a liquid-free release 
process. These high-yield samples were used in DF-TEM because they provided 
enough clean graphene to image large numbers of grains. 

Samples for ADF-STEM. Our secondary technique produced cleaner, but lower- 
yield, graphene using a polymer-free transfer method. This technique is similar to 
the methods of ref. 20, in which TEM grids are placed on top of the foil before 
etching and attached by dropping methanol on the grids. Our main addition to this 
technique was to bake the final samples in a series of annealing processes increas- 
ing in temperature. The grids were then baked in air at 350°C for 2h. In this 
method, the samples are annealed in ultrahigh vacuum by ramping the temper- 
ature to 950 °C, holding this temperature steady for 15 min and then cooling to 
room temperature without active cooling. This annealing is done below the gra- 
phene growth temperature, and the micrometre-scale grain structure did not 
change afterwards. Thus, any change that may result from annealing should be 
small in comparison with changes occurring during the formation of the grain 
boundaries. A final step was to anneal the grids at 130 °C for >8 h before trans- 
ferring them in air to the TEM. Because this transfer method uses no support film 
for the graphene as it is transferred, this method was a comparatively low-yield 
transfer process with coverage of just a few per cent over the holes. The advantage 
to this technique over the polymer-based transfer is that it produced graphene with 
less surface carbon contamination—regions hundreds of nanometres wide 
appeared atomically clean in ADF-STEM images. 

Electrically contacted samples. We fabricated top-gated graphene devices in 
four-point probe geometry (shown in Supplementary Fig. 11a, b, with electrodes 
labelled). A transferred graphene film was patterned by photolithography and a 
10-s exposure to an oxygen plasma to define the graphene strips. This was followed 
by fabricating 1.5-nm Ti/4.5-nm Au electrodes. We patterned a top gate, to mea- 
sure the charge mobility in graphene, by electron beam evaporation first of 90 nm 
of silicon oxide as a dielectric layer and then of a Cr/Au layer (1.5nm/50 nm), 
without breaking vacuum between each evaporation. 

For the EFM measurements, we fabricated electrically contacted, suspended 
graphene by growing single-layer graphene on copper using CVD; patterning 
the graphene into 3-j1m-wide strips while still on the copper foil, using contact 
lithography; and transferring the patterned strips onto a substrate with pre-patterned 
gold electrodes and trenches. 
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Primitive agriculture in a social amoeba 


Debra A. Brock’, Tracy E. Douglas’, David C. Queller’ & Joan E. Strassmann! 


Agriculture has been a large part of the ecological success of 
humans’. A handful of animals, notably the fungus-growing ants, 
termites and ambrosia beetles**, have advanced agriculture that 
involves dispersal and seeding of food propagules, cultivation of 
the crop and sustainable harvesting’. More primitive examples, 
which could be called husbandry because they involve fewer adapta- 
tions, include marine snails farming intertidal fungi® and damsel- 
fish farming algae’. Recent work has shown that microorganisms 
are surprisingly like animals in having sophisticated behaviours 
such as cooperation, communication*” and recognition’®”, as well 
as many kinds of symbiosis'”'’. Here we show that the social 
amoeba Dictyostelium discoideum has a primitive farming sym- 
biosis that includes dispersal and prudent harvesting of the crop. 
About one-third of wild-collected clones engage in husbandry of 
bacteria. Instead of consuming all bacteria in their patch, they stop 
feeding early and incorporate bacteria into their fruiting bodies. 
They then carry bacteria during spore dispersal and can seed a 
new food crop, which is a major advantage if edible bacteria are 
lacking at the new site. However, if they arrive at sites already con- 
taining appropriate bacteria, the costs of early feeding cessation are 
not compensated for, which may account for the dichotomous 
nature of this farming symbiosis. The striking convergent evolution 
between bacterial husbandry in social amoebas and fungus farming 
in social insects makes sense because multigenerational benefits of 
farming go to already established kin groups. 

The social amoeba Dictyostelium discoideum is well known for its 
social interactions. When prey bacteria become scarce, amoebae ag- 
gregate by the tens of thousands and produce a multicellular migratory 
slug that becomes a fruiting body in which about 20% of cells die to 
form a sterile stalk. The stalk aids the dispersal of the remaining cells, 
which differentiate into spores in a spherical structure called the 
sorus'*’” (Fig. 1a). We show that about one-third of wild-collected 
clones husband bacteria through the sporulation and dispersal process. 
We call these clones farmers because they carry, seed and prudently 
harvest their food, but the farming is primitive because no active 
cultivation is known. 

At first glance, beneficial interactions occurring between D. discoi- 
deum and the bacteria would be unexpected. In the unicellular state, D. 
discoideum are solitary predators of bacteria’*, and bacteria use a wide 
range of strategies to deter predation”’. Early life-history work on 
another species, Dictyostelium mucoroides, suggested the possibility 
of symbiosis with bacteria”! but did not document this interaction 
experimentally. The possibility fell out of favour, particularly after 
work’* finding no support for a symbiotic relationship and much 
support for a predatory one. However, that study considered only a 
single D. discoideum clone, and the extent of natural variation remains 
largely unexplored. In our study, we used a population of 35 wild 
D. discoideum clones isolated from soil collected at Mountain Lake 
Biological Station, Virginia, and Lake Itasca Biological Station, 
Minnesota (Supplementary Table 1). We observed that the sorus con- 
tents of some of the clones contained bacteria in addition to D. dis- 
coideum spores (Fig. 1b). To confirm this observation, we initially 
chose four clones that seemed to have bacteria in their sori and four 
clones that did not. We picked up the sorus contents of six random 


fruiting bodies from each and then spotted these individually on nutri- 
ent agar plates to test for bacterial growth. This assay confirmed that 
sori from some clones consistently contained bacteria that could ini- 
tiate new populations (Fig. 1c, top panel) and that sori from other 
clones did not (Fig. 1c, bottom panel). Bacteria also grew when dir- 
ectional light induced the multicellular slugs to migrate away from the 
original locations to bacteria-free zones of a plate before fruiting, indi- 
cating that bacteria are carried in the slugs. Four sets of wild isolates 
tested in this way yielded 36% farmers: 13 of 35 tested overall; 4 of 9 
from Minnesota; 5 of 14, 3 of 9 and 1 of 3 from three sample dates from 
Virginia. These data suggest that farmer clones are common, are found 
in the same locations as non-farmer clones and are therefore likely to 
have access to similar bacteria. 

To confirm that farmers and non-farmers belong to the same species, 
we constructed a Bayesian phylogenetic tree created from combined 
mitochondrial, ribosomal and variable nuclear-DNA sequence data 
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Figure 1 | Fruiting body structure and sorus contents from farmer and non- 
farmer D.discoideum clones. a, The fruiting body is composed of two main 
parts: the sorus, which contains the fertile spores, and the stalk, which holds the 
sorus aloft to facilitate spore dispersal. b, Bacteria and D. discoideum spores are 
present in the contents of a farmer sorus (left) but only spores are present in the 
contents of a non-farmer sorus (right). Scale bar, 5 jum. c, Sorus contents for six 
random, individual fruiting bodies from each of four farmers and four non- 
farmers were spotted on individual nutrient agar plates. All farmer sorus 
contents showed bacterial growth, whereas no bacterial growth was observed in 
sori from non-farmers. Plates were photographed after two days of growth. 
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from 14 clones (five farmers and nine non-farmers; Supplementary 
Fig. 1). Farmers were interspersed in the phylogeny with non-farmers, 
as expected if the trait is shuffled through the species by sex”. We also 
calculated pairwise genetic distances between farmer-farmer pairs, 
non-farmer-non-farmer pairs and farmer-non-farmer pairs and found 
no differences in the distributions (Supplementary Fig. 2). 

We sequenced a portion of the bacterial 16S ribosomal gene to 
identify the species of carried bacteria. Farmers carry a variety of 
species of bacteria, with diversity both within and between farmer 
clones, and diversity is likely to be underestimated because not all 
bacteria are culturable by our methods (Supplementary Table 2). 
About half of these bacterial strains serve as good food sources for 
D. discoideum, generally for farmers and non-farmers alike (Sup- 
plementary Table 2). The function of the other half, if any, is unknown, 
but all farmer strains transport and use the food bacteria supplied in 
the lab (either Klebsiella aerogenes or Escherichia coli), and we focus the 
remainder of the paper on food carrying. 

Carrying bacteria is a consistent clone-specific trait, and these clones 
show a number of differences from non-farmer clones that affect life 
history and fitness. To establish the consistency of the trait, we eliminated 
all living, carried bacteria from four farmers and four non-farmer con- 
trols by treating them with antibiotics. We then grew them on dead 
K. aerogenes as a food source, and confirmed using spotting tests as in 
Fig. 1 that they became bacteria free. When these bacteria-free clones 
were then grown on live K. aerogenes, all farmers regained an association 
and had these bacteria in their sori whereas non-farmers did not (Fig. 2). 

To examine costs and benefits in the farmer-bacteria interaction, we 
compared farmers with non-farmers under several conditions. Soil is a 
very structured, heterogeneous environment in the wild, where bacteria 
could occur in patchy, monospecific colonies of variable cell num- 
bers****, and the patchiness is accentuated because some dictyostelids 
have distinct food preferences”. We therefore mimicked spore dispersal 
to both bacteria-poor and bacteria-rich sites. For the former, we trans- 
ferred D. discoideum spores to plates with nutrients for bacteria, but 
without added bacteria (Fig. 3a). We found that bacteria transported 
by the farmer spores proliferate and are consumed by the farmers that 
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Figure 2 | Farmers readily reassociate with bacteria, suggesting a persistent 
interaction. After antibiotic treatment to eliminate carried bacteria, 
reassociation with bacteria was tested by growing four farmer and four non- 
farmer clones on K. aerogenes. We spotted ten random sori from each clone 
individually on nutrient agar plates, and counted positive/negative growth of 
bacteria in each sorus after two days. Farmers significantly differ from non- 
farmers in their ability to associate with bacteria (F),6 = 48.864, P < 0.001; error 
bars, s.e.m.). Pictured below the graph are representative examples of sorus 
contents for farmer and non-farmer clones after one round of growth with live 
bacteria. Farmer clone sori contain bacteria; no bacterial growth was detected 
for non-farmers, even after ten days. 
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also proliferate and then sporulate after the social stage, whereas non- 
farmers with no bacterial partners produce hardly any spores 
(F\6 = 58.97 (derived from F-test; subscripts, degrees of freedom), 
P<0.0001). Farmers are thus able to capitalize on available nutrients 
by carrying their own food bacteria in their sori. This difference disap- 
pears, and farmers do as poorly as non-farmers, if they are previously 
made bacteria free using antibiotics (F,6 = 0.49, P = 0.8082; data not 
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Figure 3 | The advantage of carrying food is context dependent. a, Farmers 
produce more spores than non-farmers when colonizing plates with bacterial 
food substrates but without bacteria (F,,6 = 58.97, P< 0.0001). We used three 
replicates of each of four farmer and four non-farmer clones. b, Farmers 
produce fewer spores than non-farmers when provided a fixed amount of live 
bacteria (F),12 = 64.36, P< 0.0001). We used three replicates from each of five 
farmer and nine non-farmer clones (the results are similar when only the eight 
clones in a are used). c, Farmers produce more spores than non-farmers in 
unsterilized soil (soil 1: F149 = 14.82, P = 0.0032; soil 2: Fy 49 = 26.21, 
P=0.0005). We used individual sori collected from six farmers and six non- 
farmers to test spore production after one round of the social stage. All error 
bars, s.e.m. d, Representative examples of farmer and non-farmer fruiting 
bodies formed after the social stage in unsterilized soil. Scale bar, 1 mm. 
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shown). Next we tested whether farming was costly when spores are 
transferred instead to a site where edible bacteria are abundant, reducing 
or eliminating the advantage to farmers of bringing their own bacteria. 
Farmer clones then produce fewer spores than non-farmers froma given 
number of live bacteria (F112 = 64.36, P< 0.0001; Fig. 3b). Therefore, 
carrying seed stocks can be advantageous or disadvantageous, depend- 
ing on bacterial availability at new sites. 

Sites entirely lacking bacteria in nature are rare, but farmers could 
still gain by bringing preferred bacteria (whatever allowed them to 
flourish and sporulate in their previous site), just as humans seed 
preferred plants in an already green world. We therefore tested spore 
production for farmers and non-farmers in unsterilized soil collected 
from two separate locations (soil 1 and soil 2) at the Houston 
Arboretum and Nature Center, Texas. We determined the numbers 
of colony-forming units of culturable bacteria in soil 1 and soil 2 to be 
(1.3-2.3) X 10° and (0.6-0.64) x 10° per gram of soil, respectively. 
However, the bacteria already present in the soil do not make bacteria 
carrying superfluous. Under these conditions, farmers produced more 
spores than non-farmers for both soil locations (soil 1: Fy,49 = 14.82, 
P= 0.0032; soil 2: F),19 = 26.21, P = 0.0005; Fig. 3c, d), as well as many 
more fruiting bodies (soil 1: F,19 = 35.78, P=0.0001; soil 2: 
F,19 = 9.31, P = 0.0122; data not shown). Farmer sori continued to 
carry their original bacteria (43 of 44 isolates checked by sequencing) 
whereas no bacteria were isolated from non-farmer sori. This suggests 
the bacteria available in the two test soils were not very suitable for both 
non-farmers and farmers, so the bacteria carried by the farmers 
allowed them to flourish in comparison with the non-farmers. 

We proposed that the lower success of farmers when bacteria are 
provided stems from prudent harvesting. D. discoideum amoebae 
normally leave the solitary stage and enter the social stage when food 
is exhausted’*'’, but farmers may do so sooner to save some bacteria for 
transport. We therefore measured the number of uneaten bacteria 
present along a developmental time course (Fig. 4a). During the solitary 
amoeba stage (day 1), there was no difference in bacterial density 
among treatments (also, day-1 farmers and non-farmers did not differ 
significantly in number of amoebas: F;,;7 = 0.6733, P = 0.4233; data 
not shown). During the social stage, however, bacterial usage differs 
significantly between farmers and non-farmers at all time points. Non- 
farmers eat all the bacteria whereas farmers leave many bacteria uncon- 
sumed, roughly half the number present as compared with bacteria 
grown alone. Figure 4b shows representative farmer and non-farmer 
clones photographed at day 5, revealing that only the farmers entered 
the social stage and formed fruiting bodies in the presence of uneaten 
bacteria. Thus, it seems that farmer clones forgo considerable food to 
save some for co-dispersal. 

Farmer clones also migrate significant less far than non-farmers 
during the mobile slug stage that immediately precedes fruiting 
(F,,14 = 87.59, P< 0.001; Fig. 4c). This might be a cost caused by bac- 
terial interference, or it might be an evolved response of not needing to 
moveas far when farmers carry their own bacteria. Either way, it adds to 
the list of significant differences between farmers and non-farmers. 

An alternative explanation for the apparent costs—leaving some 
bacteria unharvested and reduced slug migration—is that the bacteria 
are pathogenic and harm D. discoideum. However, the pathogen hypo- 
thesis does not account for why the farmer strains would be infectible 
by many bacteria and why infection is highly consistent, even after 
curing. Neither does it explain why all farmers carry food. Moreover, it 
does not explain why infection causes no cost to growth in the solitary 
stage (F, 1; = 0.72, P = 0.4132; Supplementary Fig. 3). Instead, the 
costs appear precisely where the farming hypothesis predicts (saving 
some food for transport) or where it provides a plausible explanation 
(less need for slug migration). However, a mixed explanation seems 
possible. Carrying food bacteria could have the side effect of sometimes 
taking up useless or harmful bacteria. The existence of the farmer 
polymorphism will allow additional within-species comparisons that 
should help in exploring the mechanisms, which are as yet unknown, 
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Figure 4 | Life-history traits differ between farmers and non-farmers. 

a, Bacterial usage. For 11 non-farmer clones and nine farmer clones, we 
individually spotted a fixed number of spores mixed with the food bacterium K. 
aerogenes on nutrient agar plates. We also spotted K. aerogenes alone as a 
control. Bacterial density for each spot was determined at various time points 
during development. During the social stage (days 3, 5, 7), farmers leave more 
bacteria unconsumed than non-farmers at all time points (type: F215 = 106.18, 
P<0.0001; day: F233 = 10.23, P = 0.0003; type X day: F133 = 10.60, 
P<0.0001; significant differences found within each day are indicated by 
different letters, which reflect results of a post hoc Tukey HSD test). At the 
solitary-amoeba stage (day 1), no difference in bacterial density was found 
among the three treatments (F,;9 = 1.2943, P = 0.2972). b, Bacterial usage of 
representative clones. Farmer clones develop fruiting bodies before all bacteria 
are exhausted whereas non-farmers consume all accessible food sources before 
fruiting. Examples of fruiting bodies are marked with arrows and appear as 
white dots. Clones were photographed on day 5. Scale bar, 3 mm. ¢, Migration. 
The average distance farmer slugs migrate towards light is less than for non- 
farmer slugs (F114 = 87.59, P< 0.001). Eight clones of each type were used, 
with two replicates. All error bars, s.e.m. 


but could be very simple alterations in timing of aggregation, sensitivity 
to bacteria or ability to produce a specific enzyme or toxin. The abund- 
ant scientific resources available for this model organism have recently 
proven very useful in understanding the genetics of their social inter- 
actions”. They should prove similarly useful here, providing a 
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unique model system for probing the genetics of eukaryotic—bacterial 
symbioses. 

The connection between farming and sociality may not be coincidental, 
because social species have suitably structured populations. In this social 
microbe, the advantage of prudent harvesting and seeding is large 
because it can benefit many generations of cell descendants before fruit- 
ing. Moreover, the high relatedness of natural fruiting bodies” mini- 
mizes any potential exploitation by non-farmers, which could either 
consume the bacteria that the farmers would save to carry, or freeload 
when co-dispersed with farmer spores and their bacteria. This same 
advantage—long-lived groups of kin—provides similarly fertile ground 
for agriculture in the ants and termites that are the most sophisticated 
non-human farmers. 


METHODS SUMMARY 


Isolation of wild D. discoideum strains. Isolation techniques followed published 
methods* with the exception that we collected soil samples of 20g or more, 
instead of 0.2 g, at each location. 
Culture conditions. We grew all wild isolates from spores on SM/5 agar plates (2 g 
glucose, 2g BactoPeptone (Oxoid), 2 g yeast extract (Oxoid), 0.2g¢ MgCh, 1.9¢ 
KHPO,, 1 g K,HPO,and 15 g agar per litre) in association with bacteria K. aerogenes 
or E. coli at room temperature (21 °C). 
PCR amplification and sequence identification of novel bacterial isolates. We 
followed the procedures outlined in “Identifying unknown bacteria using bio- 
chemical and molecular methods” (http://www.nslc.wustl.edu/elgin/genomics/ 
Bio3055/IdUnknBacteria06.pdf) with one exception. Bacteria to be cloned and 
identified were grown on and collected from SM/5 agar plates. PCR fragments 
generated (using the above procedure) were sequenced at Lone Star Labs 
(Houston, Texas). We used the NCBI website http://www.ncbi.nlm.nih.gov/ 
sutils/genom_table.cgi as the search tool for sequences to identify bacteria to 
species. 

Data analyses. We analysed our data using standard analysis-of-variance methodo- 
logy with fixed (farmer and non-farmer) effects and a random effect (clone) for all 
experimental assays. The data analysis was generated using SAS software (version 
9-1 of the SAS System for Microsoft Windows). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Isolation of wild D. discoideum strains. Isolation techniques followed published 
methods” with the exception that we collected soil samples of 20g or more, 
instead of 0.2 g, at each location. 

Culture conditions. We grew all wild isolates from spores on SM/5 agar plates (2 g 
glucose, 2g BactoPeptone (Oxoid), 2 g yeast extract (Oxoid), 0.2g¢ MgCl, 1.9g 
KHPO,, 1 g K,HPO, and 15 g agar per litre) in association with bacteria K. aerogenes 
or E. coli at room temperature. 

PCR amplification and sequence identification of novel bacterial isolates. We 
followed the procedures outlined in “Identifying unknown bacteria using bio- 
chemical and molecular methods” (http://www.nslc.wustl.edu/elgin/genomics/ 
Bio3055/IdUnknBacteria06.pdf) with one exception. Bacteria to be cloned and 
identified were grown on and collected from SM/5 agar plates. PCR fragments 
generated (using the above procedure) were sequenced at Lone Star Labs 
(Houston, Texas). We used the NCBI website http://www.ncbi.nlm.nih.gov/ 
sutils/genom_table.cgi as the search tool for sequences to identify bacteria to 
species. 

Data analyses. We analysed our data using standard analysis-of-variance methodo- 
logy with fixed (farmer and non-farmer) effects and a random effect (clone) for all 
experimental assays. The data analysis was generated using SAS software (version 
9-1 of the SAS System for Microsoft Windows). 

Experimental assays. Farmer test. We picked up the sorus contents of 6-12 
random fruiting bodies developed on SM/5 plates in association with E. coli or 
K. aerogenes from each wild clone to be tested using a filtered pipette tip. The sorus 
contents were spotted individually on SM/5 agar plates and assessed for bacterial 
growth after two days at room temperature. 

Reassociation assay. We used a population of four farmers and four non-farmers 
treated to eliminate all living, carried bacteria. To accomplish this, clones were 
grown on SM/5 agar plates containing antibiotics (0.1 g ampicillin, 0.3 g strep- 
tomycin sulphate per litre) using dead E. coli as a limited food source to allow the 
spores to hatch and develop. Dead E. coli was prepared by autoclaving a suspension 
of E. coli and KK2 (2.25 g KH2PO, and 0.67 g K,HPO, per litre H2O) for 20 min at 
121°C. After autoclaving, the absorbance of dead bacteria was set to Agog 6.0. 
Treated clones were allowed to form fruiting bodies, spores were collected and 
the process was repeated. We performed the farmer test after the second round and 
no bacterial growth was seen in the sorus contents of test clones even after ten days. 
We harvested spores from the bacteria-free, cured clones in KK2 and the spore 
density for each clone was determined by serial dilution using a haemacytometer 
under a light microscope. Spores from these treated clones were then reintroduced 
to live E. coli, plated on SM/5 agar plates, and allowed to form fruiting bodies. Ten 
random sori were collected individually from each clone and tested for bacterial 
growth following the farmer test above. 

Migration assay. We tested the slug migration ability of eight farmers and eight 
non-farmers per replicate, for two replicates. Duplicate plates were set up for each 
clone per replicate. For each plate, we prepared a slurry of 5 X 10° spores of one 
D. discoideum clone, K. aerogenes and KK2. We then applied the slurry to the edge 
of a 150 15mm? Petri plate containing 80ml nutrient-free agar (0.198 g 
KH>POu, 0.0356 g NagHPO, and 15g agar per litre), and allowed the slurry to 
dry. Each plate was wrapped in aluminium foil and a small pinhole was made 
directly opposite the spores and bacteria to provide directional light. The wrapped 
plates were placed in a lit incubator at 21°C for 84h to allow slugs to form and 
migrate. For each clone, we counted the distance travelled by slugs in each of five 
1.5-cm bands across the nutrient-free agar plate, and calculated the average dis- 
tance travelled for all slugs. 

Spore production assay. We tested two conditions: first, nutrients for bacterial 
growth but no outside bacteria were provided as food for the test clones; second, 
nutrients for bacterial growth as well as outside bacteria were provided as food for 
the test clones. For the first condition, we used four farmer clones and four non- 
farmer clones per replicate, with three replicates. We spread 200 ll of 10° spores 
plus dead E. coli in KK2 (absorbance, A¢goo 6.0) on 100 X 15 mm? $M/5 agar plates. 
Bacterial growth is possible for farmers in this condition. After development was 
complete, spores were collected by washing the plates with KK2 plus 0.1% NP-40 
(Calbiochem). The total number of spores produced by each clone was determined 
by counting using a haemacytometer and a light microscope. As a control to assess 
confounding growth differences between farmers and non-farmers, all eight 
clones were grown as above but the plates were supplemented with antibiotics 
(see “Reassociation assay”, above) to eliminate any potential food bacteria carried 
by the clones. No farming is possible in this set-up. 

For the second condition, we used five farmer clones and nine non-farmer 
clones per replicate, with three replicates. For the assay, we prepared nutrient-free 
agar plates (see “Migration assay”, above) laid with a grid of equidistant 13-mm 
AABP 04700 (Millipore) black filter squares. Filters were spotted individually with 
5X 10° spores in a slurry of live K. aerogenes and KK2 (Agoo 6.0). Clones were 
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spotted in an order determined by a random number generator. Duplicate samples 
were made for each clone for each experiment. Clones were allowed to hatch, grow 
and develop under direct light to limit potential movement of slugs before final 
culmination in fruiting bodies. Development was complete for all clones after four 
days. Each filter was collected and placed in a 1.5-ml conical Eppendorf tube 
containing 1.0 ml KK2 plus 0.1% NP-40 alternative. Tubes were vortexed briefly 
to disperse the spores evenly and counted as above without dilution to determine 
density. 

Soil assay. We collected soil from two separate locations in the Houston Arboretum 
and Nature Center (29° 46’ N, 95° 27’ W). Thin, non-nutrient agar plates were 
prepared to provide a humid environment for spores to hatch and to hold the soil 
in place. Soil was then layered to a ~2-3-mm depth in a half-moon pattern with an 
empty space at the centre line atop the starving agar on each plate (Supplementary 
Fig. 4a). We used 12 clones consisting of six farmers and six non-farmers. Farmer- 
non-farmer clones were randomly paired on each plate (one on each half-moon of 
soil) as a check for plate environment bias. Each pair was tested on two separate plates 
for each soil type. Data analysis was reported unpaired as no plate environment bias 
was detected. 

For the assay, fruiting body sori were collected from stock plates of all 12 clones 
previously prepared on the same day by plating 2 x 10° spores in association with 
K. aerogenes on SM/5 agar plates. The contents of an individual sorus for either a 
farmer or a non-farmer clone were picked up using a filtered pipette tip and placed 
on the unsterilized soil in the previously chosen locations (marked by coloured 
circles in Supplementary Fig. 4b). Three farmer sori and three non-farmer sori 
were placed on each experimental plate in this manner. After three days at room 
temperature under direct overhead light, all plates were viewed under a dissecting 
microscope to locate and collect all fruiting bodies formed. Fruiting bodies found 
were generally located in the same area as spotted on the soil. Whole fruiting 
bodies from all three spots per plate for either the farmer or the non-farmer were 
collected and placed together in an Eppendorf tube containing 1 ml KK2 plus 0.1% 
NP-40. Tubes were vortexed briefly to disperse the spores, and the spores were 
counted without dilution using a haemacytometer. The change in spore number 
for each clone was then calculated. To determine the initial number of spores 
spotted for each clone without diluting sorus contents, a proxy was used. The sorus 
contents from ten random fruiting bodies from each clone were collected and 
counted as above. The average count for these ten fruiting bodies was used to 
determine the clone’s average spore number per sorus spotted on the experimental 
plates. Additionally, we determined presence or absence of bacteria in the sori as 
well as bacteria identity for positive growth using a subset of ten clones (five 
farmers and five non-farmers) in both types of soil. We performed serial dilutions 
in KK2 of 80 sori contents (44 farmer and 34 non-farmer), and the clonal isolates 
recovered were used for sequencing following the methods described in “PCR 
amplification and sequence identification of novel bacterial isolates”, above. 

Bacteria usage assay. We collected spores from a population of 11 non-farmer 
and nine farmer clones. For each clone, we individually spotted four 30-11 spots of 
3 X 10* spores mixed with live K. aerogenes (Agog 3.0) on a SM/5 plate as well as 
spotting K. aerogenes alone as a control. Bacterial density was determined by using a 
sterilized inoculating loop to collect all growth from one spot for each clone ina 1.5-ml 
Eppendorf tube containing 1 ml KK2, vortexing to obtain a uniform suspension, 
removing hatched amoeba or spores by briefly centrifuging at 2,000g to pellet, and 
then determining the absorbance (Agoo) of the remaining bacteria. Data points were 
collected on days 1, 3,5 and 7. To determine whether confounding differences in spore 
germination occurred among clones, the number of hatched amoeba was determined 
for each clone on day 1 by counting using a haemacytometer. 

Proliferation assay. To determine vegetative doubling rates during exponential 
growth, we grew each clone separately by plating 1 X 10* log-phase cells per plate 
in association with K. aerogenes as a food source on replicated SM/5 agar plates. 
After 12h of growth, we collected all cells from a plate, diluted the cells in a 
measured amount of KK2 and counted the number of cells present using a hae- 
macytometer. We repeated this process for plates grown for 18, 24, 30, 36 and 42 h. 
We conducted the experiment in two temporally independent blocks. To analyse 
the data, we log-transformed the counts, determined the slope for each clone and 
performed a full-factor analysis of covariance. 

Phylogeny construction. DNA sequencing. We extracted DNA from spores using 
a Chelex/proteinase K extraction protocol. We amplified a non-coding region of 
the mitochondrial genome (mtDNA), regions of the nuclear ribosomal DNA 
(rDNA) and six variable fragments of nuclear DNA by PCR with the primers 
listed in Supplementary Table 3, using the following protocol (step 1: 2 min at 
94 °C; step 2: 30s at 94°C; step 3: 30s starting at 65 °C and decreasing by 1 °C per 
cycle; step 4: 1 min at 72 °C; step 5: 15 cycles to step 2; step 6: 30s at 94 °C; step 7: 
30s at 50°C; step 8: 1 min at 72 °C; step 9: 25 times to step 6; step 10: 6 min at 
72°C). We cleaned the PCR product with USB ExoSAP-IT and then sequenced 
using Perkin Elmer Applied Biosystems Big Dye 3.1 chemistry and a 3100 genetic 
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analyser. We analysed approximately 4,300 base pairs of the nucleotide sequence 
of nuclear 17S, 5.8S, 26S and 5S rDNA regions, approximately 800 base pairs of the 
nucleotide sequence of mtDNA (LSU intron) and approximately 3,700 base pairs 
of variable nuclear DNA from chromosomes 1, 2, 3 and 4. We aligned the 
sequences using the programs LASERGENE SEQMAN (version 7.0.0) and 
BIOEDIT (version 7.0.5.2). 

Data analysis. We used comparative DNA sequence data from 14 individual 
clones (QS numbers 1, 4, 6, 8-9, 11-12, 14-15, 17-18 and 21-23) to estimate gene 
trees/phylogenies and to estimate pairwise genetic distances between clones. These 
14 clones represent five farmers and nine non-farmers. We used Bayesian methods 
for phylogenetic reconstruction. Using MRBAYES* (version 3.1), we estimated a 
phylogeny for each data set based on the GTR+I+I model of molecular evolu- 
tion. In addition, two high-frequency, polymorphic indels (one in the mtDNA and 


one in the variable nuclear DNA) were scored as standard presence/absence 
characters and were included in the analysis with weighting equal to the nucleotide 
polymorphisms. For each analysis, four Metropolis-coupled Markov chains were 
run for 250,000 burn-in generations followed by 1.75 X 10° generations of data 
collection. We used the software program MEGA” (version 4) to estimate pairwise 
genetic distances between clones using the p-distance algorithm. We analysed 
sequence data from all 14 individual clones. Gaps and missing data were elimi- 
nated in pairwise sequence comparisons. 


31. Huelsenbeck, J. P. & Ronquist, F. MRBAYES: Bayesian inference of phylogenetic 
trees. Bioinformatics 17, 754-755 (2001). 

32. Tamura, K., Dudley, J., Nei, M.& Kumar, S. MEGA4: molecular evolutionary genetics 
analysis (MEGA) software version 4.0. Mol. Biol. Evol. 24, 1596-1599 (2007). 
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Preplay of future place cell sequences by 
hippocampal cellular assemblies 


George Dragoi' & Susumu Tonegawa! 


During spatial exploration, hippocampal neurons show a sequential 
firing pattern in which individual neurons fire specifically at par- 
ticular locations along the animal’s trajectory (place cells’”). 
According to the dominant model of hippocampal cell assembly 
activity, place cell firing order is established for the first time during 
exploration, to encode the spatial experience, and is subsequently 
replayed during rest*° or slow-wave sleep’'° for consolidation of 
the encoded experience'’*. Here we report that temporal sequences 
of firing of place cells expressed during a novel spatial experience 
occurred on a significant number of occasions during the resting or 
sleeping period preceding the experience. This phenomenon, which 
is called preplay, occurred in disjunction with sequences of replay of 
a familiar experience. These results suggest that internal neuronal 
dynamics during resting or sleep organize hippocampal cellular 
assemblies'** into temporal sequences that contribute to the 
encoding of a related novel experience occurring in the future. 

We recorded neuronal firing sequences from the CA1 area of the 
mouse hippocampus (Supplementary Fig. 1) during periods of awake 
rest (Fam-Rest) alternating with periods of running (Fam-Run) on a 
familiar track (Fam session; Supplementary Fig. 2a) that preceded the 
exploration of a novel linear arm in contiguity with the familiar track 
(Contig-Run on L-shaped track; Fig. 1, Supplementary Fig. 2a and 
Methods). All the place cells active on the novel arm during Contig- 
Run, whether previously silent'® (19% in both directions and 31% in at 
least one direction; Methods and Supplementary Tables 1-3) or active 
during Fam-Run (subpanels a in Fig. 1), fired during Fam-Rest at the 
ends of the familiar track (range, 0.17-11.7 Hz; Supplementary Fig. 3) 
as part of a number of ‘spiking events’. The spiking events were defined 
as epochs composed of multiple individual spikes from at least four 
different place cells active on the novel arm or familiar track, separated 
by less than 50 ms and flanked by at least 50 ms of silence**. More 
significantly, the temporal sequence in which the cells active on the 
novel arm fired during Fam-Rest (subpanels b in Fig. 1) was signifi- 
cantly correlated with the spatial sequence in which they fired later as 
place cells on the novel arm during Contig-Run (subpanels c in Fig. 1), 
despite being uncorrelated with their spatial sequence as place cells on 
the familiar track during Fam-Run. This is illustrated as place cell 
sequences during Contig-Run (subpanels c in Fig. 1) and Fam-Run 
(subpanels a in Fig. 1) compared with the firing sequences of these cells 
within individual spiking events observed during Fam-Rest (subpanels 
b in Fig. 1). We refer to this process as ‘preplay’ of place cell sequences 
because the temporal sequence of firing during Fam-Rest had occurred 
before the actual exploration of the novel arm in the subsequent 
Contig-Run and was not a replay of the place cell sequences from 
the previous Fam-Run. 

To quantify the significance of preplay and to compare it with replay, 
we created place cell sequence templates according to the spatial order 
of the peak firing of place cells**"° on the novel arm during Contig-Run 
(novel arm templates; subpanels c in Fig. 1 and Methods) and on the 
familiar track during Fam-Run (familiar track templates) for each run 
direction. The spikes of all the place cells used to construct the two types 


of template that were emitted during Fam-Rest were sorted by time, and 
spiking events were determined as explained above (subpanels b in 
Fig. 1). For each spiking event, we calculated a rank-order correlation 
between the novel arm templates and the temporal sequence of firing of 
the corresponding cells in the spiking events during Fam-Rest. The 
event correlation was considered significant if it exceeded the 97.5th 
percentile of a distribution of correlations resulting from randomly 
shuffling the order of place cells in the novel arm templates 200 times 
(P < 0.025). Forward* and reverse** preplay refers to the cases in which 
the sequence of place cells during Contig-Run and the firing order of 
the corresponding cells in Fam-Rest were in the same and opposite 
directions, respectively. In 91% of the preplay cases, the spiking events 
were correlated with the novel arm template in one direction only. The 
distribution of event correlation values obtained using the original 
novel arm templates was significantly shifted towards higher positive 
or negative values in comparison with the distribution of correlation 
values obtained using shuffled templates (Fig. 2a and Supplementary 
Fig. 4). Figure 2a also shows the distribution of significant preplay 
events (in red). Of all the spiking events detected as above and in which 
at least four novel arm place cells were active, 14.2% were significant 
preplay events for the place cell sequence on the novel arm (P< 10 ~*, 
binomial probability test*) in the forward or reverse order (Fig. 2b). 

The occurrence of significant preplay events was correlated with the 
occurrence of high-frequency ripple oscillations in CA1 (Fig. 2c). The 
majority of the significant preplay events (81.1%; Fig. 2d, total, blue) 
took place at the junction between the familiar and novel arms, and the 
remaining 18.9% took place at the free end of the familiar track 
(Fig. 2d, total, purple). The proportion of significant preplay events 
among the total events at each of the two track ends was higher at the 
junctional end (15.2%, P< 10 *°) than at the free end (8.5%, 
P< 10 *) of the familiar track (P < 0.035, Z-test; Fig. 2d, normalized). 

We found a relatively high correlation between the place field maps 
(Fig. 1A, B and Supplementary Fig. 5) of the familiar track before and 
after the novel experience (median r= 0.66; Fig. 2e, familiar track, 
blue); it was significantly higher than the correlations obtained when 
the cell identities were shuffled (median r = 0.23, P< 10 *; Fig. 2e, 
familiar track, black). A similar correlation analysis showed a relatively 
high stability of the newly formed place fields on the novel arm from 
the beginning to the end of Contig-Run (median r= 0.62 (newly 
formed) versus median r= 0.21 (shuffled), P< 10-?; Fig. 2e, novel 
arm, blue versus grey). These results suggest that preplay of the novel 
arm does not occur over an entirely new (that is, remapped) repre- 
sentation of the whole L-shaped track but rather benefits from the 
relative stability of the familiar track representation across sessions 
and perhaps facilitates the rapid, stable encoding of the novel arm 
experience. 

Using the familiar track templates and spiking events during Fam- 
Rest, constructed as above, we determined that 16.2% (P< 10 1. data 
not shown) were significant replay events**'” among the spiking events 
in which a minimum of four familiar track place cells were active. All 
significant preplay events occurring during Fam-Rest (n = 75) were 
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Figure 1 | Preplay of novel place cell sequences. Fam-Run and Fam-Rest 
respectively denote run and rest sessions on the familiar linear track before 
barrier removal; Contig-Run denotes run sessions on the L-shaped track after 
barrier removal. The L-shape track was linearized for display/analysis. 

A, B, mouse 1; C, D, mouse 2; E, mouse 3. A-E, a, Spatial activity on the familiar 
track during Fam-Run of the cells that had place fields in Contig-Run and 
preplayed during Fam-Rest (one cell per row); activity on the novel arm and 
familiar track are on the same scale. Horizontal arrows indicate run directions. 
Vertical grey bars indicate barrier locations during Fam-Run and Fam-Rest. 
A-E, b, Examples of representative spiking events in the forward or reverse 


tested for possible replay of the familiar track spatial sequence: these 
spiking events were more correlated with the novel arm template 
(Fig. 2f, red) than the familiar track template (Fig. 2f, blue). Seventy- 
two percent (n = 54) of the significant events previously considered to 
be preplay had no significant correlation with the familiar track tem- 
plate. An additional 16% (n = 12) of those events were better correlated 
with the novel arm templates (mean absolute r = 0.92) than with the 
familiar track template (mean absolute r = 0.67, P< 10 °). Together, 
these findings reject the hypothesis that the preplay events simply rep- 
resent a replay of the familiar track activity (see additional controls in 
Supplementary Information). Moreover, we found that the proportion 
of events exclusively composed of silent cells that perfectly matched the 
novel arm spatial templates was 0.67 (16 of 24 triplets), which is sig- 
nificantly greater (P < 0.025) than the proportion of by-chance perfect 
matches (0.33). 

To illustrate the distribution and relative proportions of preplay and 
replay events among all significant spiking events during Fam-Rest, we 
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direction during Fam-Rest in 250-ms time windows (350 ms for the second and 
fourth panels from left in E, b). Tick marks indicate individual spikes: red, 
preplay events for place cell sequences in the novel arm; blue (in A, b and 

B, b), additional spikes from the familiar track place cells participating in the 
spiking event (not shown in C-E, b). Numbers on the left denote cell numbers 
and correspond to the place cell numbers in A-E, a. Square boxes indicate the 
ends of the familiar track where preplay events occurred. Local field potentials 
recorded simultaneously with the spikes are shown above spiking events. 
A-E, c, Place cell sequences in the novel arm (C-E, c; red) or in both the novel 
arm (red) and the familiar arm (blue) (A, c and B, c) in Contig-Run. 


calculated a ‘template specificity index’ (Fig. 2g and Methods) for each 
event. Pure preplay events (Fig. 2g, red) and pure replay events (Fig. 2g, 
blue) were segregated, and only a minority of events were significant 
for both preplay and replay (Fig. 2g, yellow). Consistent with this 
segregation of preplay and replay events, the novel arm and the ‘cor- 
responding familiar track’ templates were not significantly correlated 
(Fig. 2h and Methods). The ratio between the number of pure replay 
events (n = 171) and the number of pure preplay events (n = 54) 
during Fam-Rest was about 3.1 (Fig. 2g, inset; see Supplementary 
Information for proportions of events). Preplay and replay events were 
distributed in time across Fam-Rest (Supplementary Fig. 6a—c) and 
their occurrences were generally uncorrelated (Supplementary Fig. 6d). 
The majority (79.9%) of the spiking events during Fam-Rest did not 
significantly correlate with either of the two templates (data not shown). 

We used a Bayesian reconstruction algorithm”**"*"” (Methods) to 
decode the animals’ position from the spiking activity during Fam- 
Run (Fig. 3a) or Fam-Rest (Fig. 3b, c). For all original and shuffled® 
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Figure 2 | Quantification of the preplay phenomenon and comparison with 
replay. a, Distribution of correlations between spiking events in Fam-Rest and 
spatial templates of the novel arm. Open bars indicate spiking events versus the 
original (unshuffled) templates; filled bars indicate spiking events versus 200 
shuffled templates scaled down 200 times; red bars show the distribution of 
preplay (that is, significant) events. Similar distributions (not shown) of 
corresponding spiking events were obtained when spatial templates were 
constructed using all place cells active on the L-shaped track (Figs 1A, b, c and 
1B, b, c; red and blue). b, Proportion of all, forward and reverse preplay events 
among the spiking events in Fam-Rest. The dotted line indicates the chance 
level (3.2%). ¢, Cross-correlation between preplay events and ripple epochs. 
d, Location of preplay events on the familiar track: total, proportions of preplay 
events at ends of the track; normalized, proportion of preplay events 
normalized by the number of spiking events at each end of track. Preplay events 
represented a trajectory running from the free end of the novel arm to the 
junctional end (40%) or begun near the familiar track (60%); the latter suggests 
that in some cases preplay events could be triggered by the activity of the 
familiar track place cells during Fam-Rest. e, Stability of place cell spatial tuning 
across the novel experience: familiar track, stability of the place fields active on 
the familiar track before (Fam-Run) versus after (Contig-Run) barrier removal; 
novel arm, stability of the place fields active on the novel arm at the beginning 


probability distributions, a line was fitted to the data using a line-finding 
algorithm® to represent the decoded virtual trajectory (Methods 
and Supplementary Information). In 16.36% of cases representing 
trajectories, the reconstructed trajectory during spiking events in Fam- 
Rest was contained within the novel arm (Fig. 3c, top), a place the 
animal had not yet visited (that is, trajectory preplay). Moreover, in 
79.8% of the trajectory preplay cases the shuffling procedures resulted 
in lines that were significantly less or not at all contained within the 
novel arm (that is, not preplay; Supplementary Information). The 
remaining trajectories decoded during Fam-Rest represented replay 
of the familiar track (64.15%; Fig. 3c, middle) or spanned the joint 
familiar track/novel arm space (19.49%; Fig. 3c, bottom). Means of 
absolute rank-order correlations between spiking activity and novel 
arm templates (Fig. 2a) restricted during epochs of trajectory preplay 
were significantly larger than those between spiking activity and familiar 
track templates calculated during the same epochs (0.75 versus 0.59, 
P<10 “). Overall, these results support the existence of the preplay 
phenomenon. 

To investigate the possibility that preplay of novel arm place cell 
sequences during Fam-Rest depends on the prior run experience on 
the familiar track, mice with no prior experience on any linear track 
were placed in a high-walled sleep box and recorded while resting/ 
sleeping. The animals were then transferred to a novel isolated linear 
track that was in the same room but could not be seen from inside the 


(first four laps of run) versus the end (last four laps) of the Contig-Run session. 
Data (blue), within-cell correlation of place cell spatial tuning for the 
corresponding track/arm; shuffle (black), cell identity shuffle (Supplementary 
Information). Error bars, s.e.m.; asterisks in d and e indicate significant 
differences. f, Distribution of preplay event correlations (red) versus 
distribution of these event correlations with the familiar track template (blue). 
Spiking events were detected using all place cells from the familiar track and 
novel arm templates (>1 Hz). Red bars are the same as in a. Correlation is 
strong with the novel arm template (preplay) and weak with the familiar arm 
template (replay). The P value corresponds to there being a significant 
difference between the two distributions. g, Disjunctive distribution of pure 
preplay (red), pure replay (blue) and preplay/replay (yellow) events during 
Fam-Rest over their template specificity index (Supplementary Information). 
Inset, proportions of pure preplay events (red), pure replay events (blue) and 
preplay/replay events (yellow) among all of the spiking events that were 
significantly correlated with at least familiar track templates or novel arm 
templates. h, Lack of correlation between the novel arm template and the 
corresponding familiar track template. Each of the six dots represents either a 
forward or a reverse run direction of one of the three mice analysed. Red 
horizontal line denotes a P value of 0.05. The correlation values were not 
significant in any of the cases (Supplementary Information). 


box, and the recording continued during de novo formation of place 
cells (Supplementary Fig. 2b, de novo session). We found that in a 
relatively large proportion (16.1%) of spiking events identified during 
sleep/rest in the sleep box, the neuronal firing sequences were signifi- 
cantly correlated with the place cell sequences observed during the first 
run session on the novel track (Fig. 4A, B and Methods); this was the 
case for all four individual mice (Supplementary Fig. 7). Preplay events 
were associated with the ripple occurrence (Fig. 4C). The place cells 
established on the novel track in the de novo session were more 
dynamic (median r = 0.42; Fig. 4D, blue) than in Contig-Run (median 
r= 0.62, P< 0.016; Fig. 2e, right, blue). 

We have demonstrated that a significant number of temporal firing 
sequences of CAI cells during resting periods of a familiar track 
exploration that preceded a novel track exploration in the same general 
environment were correlated with the place cell sequences of the novel 
track rather than the familiar track. This phenomenon, preplay, is 
temporally opposite to the process of replay*"°!°°, when activity dur- 
ing rest or sleep periods recapitulates place cell sequences that have 
already occurred during previous explorations. Preplay differs fun- 
damentally from replay because it occurs before exploration of novel 
tracks. 

Although our recordings were carried out in CA1, we believe that 
what we observed could be a reflection of the output of the recurrent 
cellular assemblies from upstream regions (CA3 or entorhinal cortex). 
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Figure 3 | Bayesian reconstruction of the animal’s trajectory in the familiar 
track (replay) and novel arm (preplay). a, Position reconstruction of a one-lap 
run on the familiar track from the ensemble place cell activity during Fam-Run. 
The heat map displays the reconstructed position of the animal using ensemble 
place cell activity during the run (250-ms bins; animal velocity, >5 cms‘). 
The yellow line indicates the actual trajectory of the animal during Fam-Run. 
b, Example of virtual trajectory reconstruction (familiar track and novel arm) 
from the ensemble place cell activity during Fam-Rest at the ends of the familiar 
track (20-ms bins; animal velocity, <5 cm s *) before barrier removal and 
novel arm exploration. The yellow line reflects the spatial location of the animal 
in time: the animal was immobile at the junction end of the familiar track. The 
time-compressed (~5 ms_') trajectory reconstruction often ‘jumps’ over the 
barrier (top of the figure) into the novel arm area. At around 0.5s, a preplay of 
the novel arm initiated from the distal (free) end of the novel arm ‘propagates’ 
towards the location of the animal. c, Examples of preplay of the novel arm 
(top), replay of the familiar track (middle) and preplay of the novel arm 
together with replay of the familiar track (bottom) during Fam-Rest. All 
conditions are the same as in b. The white line shows the linear fit maximizing 
the likelihood along the virtual trajectory. Colour bars indicate probability of 
trajectory reconstruction. 


During running on a familiar track, some of the cells in the postulated 
upstream cellular assemblies fire sequentially at spatial locations while 
others, although connected anatomically to these cells, remain silent. 
The lack of expression of preplay sequences during Fam-Run may 
reflect their state-dependent suppression or subthreshold activation 
during these exploratory behaviours. Owing to increased net excitation 
during rest periods predominantly during ripples*’, some of these 
silent cells together with some of the familiar track cells are activated 
above threshold and fire in a certain sequence. Their sequence of 
activation may be determined in part by their functional connectivity 
within the hippocampal formation network. Some of these sequences 
may in turn be activated on a novel track as place cell sequences 
(Supplementary Fig. 8). The activation of the novel place cell sequences 
during running may strengthen their pre-existing assembly organiza- 
tion manifested during preplay. 

It could be argued that during Contig-Run the animals simply con- 
sidered the novel arm to be an extension of the familiar arm and, thus, 
what we considered to be preplay events were replays of the previous 
runs on the familiar track. If this was the case, preplay events would not 
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Figure 4 | Preplay of novel place cell sequences before any linear track 
experience. A, Sleep/rest session in the sleep box (Pre-Run sleep/rest) before 
the first run session on a linear track (De novo-Run). Display format is the same 
as in Fig. 1. A, a, Representative spiking events in the forward or reverse order 
during Pre-Run sleep/rest in 400-ms time windows. A, b, Place cell sequences 
on the novel track (red) during the De novo-Run session. Each row represents 
one cell in which the activity was normalized to the maximum firing rate. One 
run direction in one animal is shown. The median number of place cells active 
on the novel track participating in preplay events is six. B, Distribution of 
spiking events in Pre-Run sleep/rest as a function of the rank-order correlation 
with the place cell sequence template of the novel track. Display format is the 
same as in Fig. 2a. C, Cross-correlation between preplay events and ripple 
epochs during Pre-Run sleep/rest. D, Stability of place cell spatial tuning across 
the novel track experience. Display format is the same as in Fig. 2e (novel arm). 
Error bars, s.e.m.; asterisk indicates significant difference. 


be expected to be found when the experience of the familiar track run is 
eliminated. This idea was refuted by the demonstration of frequent 
preplay events in the sleep box before the mice were transferred onto a 
novel linear track (de novo condition). Under this condition, the place 
cell sequences were more dynamic and a higher proportion of all 
spiking events correlated with the place cell sequences in these runs 
than in the later runs on novel linear tracks. These results suggest a 
shift in the relative contribution of internal*””’ and external drives in 
the formation of place cell sequences on encounter with a novel track. 
In the early phase, internal drives originating in the dynamic cellular 
assembly activities, which probably reflect numerous past experiences 
distinct from the current one and expressed as preplay, may have a 
greater role, whereas in the late phase, external drives that come from 
the specific set of stimuli of the current experience may dominate. 
Thus, place cell sequences on novel tracks seem to be products of a 
dynamic interplay between the internal and external drives. 

Several previous studies did not reveal preplay’*’°”. Although it is 
difficult to pinpoint the apparent discrepancies between these studies 
and the present one, we suggest that the use of insufficiently sensitive 
methods (pairwise correlations) by some studies’**° and small sample 
sizes by others’® might have precluded detection of preplay in previous 
work (see Supplementary Information for details). Data from the de 
novo condition (Fig. 4), in which we observed an even higher propor- 
tion of preplay events, have not been reported previously. 

Our data showed that novel preplay events coexist in disjunction 
with familiar replay events during the rest periods on the familiar 
track. This and the finding that these preplay and replay events 
together make up fewer than one-quarter of all detected spiking events 
suggest that they are part of a dynamic repertoire of temporal 
sequences in the hippocampus that are past-experience dependent 
(replay) or future-experience expectant™ (preplay). Post-experience 
replay of place cell sequences during resting*° or slow-wave sleep* '° 
has been proposed to have an important role in memory consolida- 
tion"’’*. The temporal preplay of new place cell sequences during 
resting or sleep is consistent with a predictive function for the hippo- 
campal formation® and may contribute to accelerating learning”® 
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when a new experience is introduced in multiple steps of increasing 
novelty. 


METHODS SUMMARY 


We recorded place cells from the CA1 area of the hippocampus with six indepen- 
dently movable tetrodes in four mice during sleep/rest sessions in the sleep box 
before any experience on linear tracks and during the first run session on a novel 
track. Following familiarization with the linear track, animals were subsequently 
allowed to explore a continuous (L-shaped) track in which the now familiar track 
and a new novel arm were made contiguous. To quantify the significance of the 
preplay and replay processes, spiking events in which at least four cells were active 
were detected during sleep/rest (speed, <1 cms ') periods in the sleep box or 
awake rest (speed, <2.cms_') periods at the ends of the familiar track and novel 
arm, predominantly during ripple epochs. 

We calculated statistical significance at the P<0.025 level for each event by 
comparing the rank-order correlation between the event sequence and the place 
cell sequence (template) with the distribution of correlation values from 200 tem- 
plates obtained by shuffling the original order of the place cells. Proportions of 
significant events were calculated as the ratio between the number of significant 
events and the total number of spiking events. We calculated the overall significance 
of preplay or replay processes by comparing the distribution of correlation values of 
all events with the distribution of correlation values of shuffled templates 
(Kolmogorov-Smirnov test). The significance of the proportion of significant events 
out of the total number of spiking events was determined as the binomial probability 
of observing the number of significant events (as successes) from the total number of 
spiking events (as independent trials), with a probability of success of 0.025 in any 
given trial. We reconstructed the position of the animal from the spiking activity 
emitted during resting periods using Bayesian decoding procedures’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Surgery and experimental design. Electrophysiological recordings were per- 
formed on four C57BL/6 mice (strain NR1-floxed’’) with ages between 18 and 
22 weeks. All animals were implanted under Avertin anaesthesia with six inde- 
pendently movable tetrodes aiming for the CA1 area of the right hippocampus 
(1.5-2 mm posterior to bregma and 1-2 mm lateral to the midline; Supplementary 
Fig. 1). The reference electrode was implanted posterior to lambda over the cere- 
bellum. During the following week of recovery, the electrodes were advanced daily 
while animals rested in a small, walled sleeping box (12 x 20 cm?, 35cm high). The 
animal position was monitored by means of two infrared diodes attached to the 
headstage. 

The experimental apparatus consisted of a 90 X 65cm” rectangular, walled, 
linear track maze. All tracks were 4cm wide at the bottom and 8-9 cm wide at 
the top, and all linear track walls were 10cm high. Experimental sessions were 
conducted while the animals explored for chocolate sprinkle rewards placed 
always at the ends of the corresponding linear tracks (one sprinkle at each end 
of the track on each lap). Neuronal activity was recorded in naive animals (four 
mice) during the sleep/rest session in the sleep box immediately preceding the first 
experience on linear tracks, and continued (Fig. 4) during the first run session on a 
novel track. After familiarization with the linear track, the animals went through a 
recording session of 15-60 min (Fam session), and the recordings continued for 
the next 34-42 min (Contig session) while the animals explored an L-shaped track 
for the first time. In this track, the familiar arm and the novel arm were made 
contiguous by removing the barrier that had separated them (Fig. 1). For the 
purpose of analysing the recording data, the Fam session was further divided into 
Fam-Run, in which the animals ran through the track (velocity of animal’s move- 
ment was higher than 5 cms ~ 1) and Fam-Rest, where the animals took awake rests 
at the ends of the track (velocity of animal’s movement was less than 2 cm s'). 
During resting periods, the animals consumed the chocolate sprinkle and 
groomed, but mostly they were still until they self-initiated the next lap of run 
on the linear track. After completion of the experiments, the brains of all mice were 
perfused, fixed, sectioned and stained using nuclear fast red (Supplementary Fig. 1) 
or cresyl violet for electrode track reconstruction. 

Recordings and single-unit analysis. A total of 87 neurons were recorded from 
the CA1 area of the hippocampus in four mice during the Fam and Contig sessions 
(Supplementary Tables 1-3). A total of 69 CA1 neurons were recorded from the 
four mice in the de novo condition (26, 20, 10 and 13 cells, respectively). Single cells 
were identified and isolated using the manual clustering method Xclust* and the 
application of cluster quality measurements”*. Pyramidal cells were distinguished 
from interneurons on the basis of spike width, average rate and autocorrelations™. 

Place fields were computed as the ratio between the number of spikes and the 
time spent in 2-cm bins along the track, smoothed with a Gaussian kernel with a 
standard deviation of 2 cm. Bins where the animal spent a total of less than 0.1s 
and periods during which the animal’s velocity was below 5 cms | were excluded. 
Place field length and peak rate were calculated after separating the direction of 
movement and linearizing the trajectory of the animal. Linearized place fields were 
defined as areas with a localized increase in firing rate above 1 Hz for at least five 
contiguous bins (10 cm). The place field peak rate and location were given by the 
rate and location of the bin with the highest ratio between spike counts and time 
spent. Place field borders were defined as the points where the firing rate became 
less than 10% of the peak firing rate or 1 Hz (whichever was bigger) for at least 
2cm. 

Local field potential analysis. Ripple oscillations were detected during sleep/rest 
periods in the sleep box and during rest periods at the ends of the tracks. The 
electroencephalography signal was filtered (120-200 Hz) and ripple-band ampli- 
tude was computed using the Hilbert transform. Ripple epochs with maximal 
amplitude more than 5 s.d. above the mean, beginning and ending at 1 s.d. were 
detected. The time of ripple occurrence (Figs 2c and 4C) was the time of its 
maximal amplitude. The proportion of ripples with which cells with place fields 
on the novel arm of the L-shaped track fired in the preceding session 
(Supplementary Fig. 3) was calculated for each qualifying cell as the ratio between 
the number of ripples during which the cell fired at least one spike and the total 
number of ripples during the corresponding exploratory session. 

Preplay and replay analyses. To analyse the preplay and replay processes, spiking 
events were detected during Pre-Run sleep/rest periods in the sleep box (de novo 
condition; velocity, <1cms ') or during awake rest periods at the ends of the 
running tracks (Contig condition; velocity, <2cms~'). A spiking event was 
defined as a transient increase in the firing activity of a population of at least four 
different place cells within a temporal window preceded and followed by at least 
50 ms of silence. Overall, similar results were obtained using 50-, 60-,75- and 100-ms 
time windows. The spikes of all the place cells active on the novel track that were 
emitted during the Pre-Run sleep/rest in the box for the de novo condition as well as 
the spikes of all the place cells active on the familiar track or the novel arm that were 


emitted during Fam-Rest session at the two ends of the familiar track for Contig 
condition were respectively sorted by time and further used for the detection of the 
spiking events. 

All four animals exhibited a significant number of spiking events in the Pre-Run 
session of the de novo condition. Three of the four animals (mice 1-3) exhibited a 
significant number of spiking events in the Contig condition, the remaining animal 
(mouse 4) having a below-threshold number of simultaneously active CA1 place 
cells. The time of the spiking event used to compute the cross-correlation with 
ripple epoch occurrence (Figs 2c and 4C) was the average time of all spikes com- 
prising the spiking event. The place cell sequences (templates) were calculated for 
each direction of the animal’s movement and for each run session (De novo-Run, 
Fam-Run and Contig-Run) by ordering the spatial location of the place field peaks 
that were above 1 Hz. For place cells with multiple place fields above 1 Hz on a 
particular arm or track in the Contig condition (six of 52 place cells active on the 
novel arm in the two directions, or 12%: two for each direction in mouse 1, one in 
mouse 2 and one in mouse 3), only the place field corresponding to the peak firing 
rate of the place cell on that arm or track was considered for the construction of the 
template of that particular arm or track, to be consistent with all the previous 
studies that used spatial templates to demonstrate replay during sleep or awake 
rest**°, Place cells with fields on both the novel arm in the Contig-Run session and 
the familiar track in the Fam-Run session participated in the construction of both 
the novel arm and familiar track templates. 

Statistical significance was calculated for each event by comparing the rank- 
order correlation between the sequence of cells’ firing during the event (that is, 
event sequence) and the place cell sequence (template), on the one hand, and the 
distribution of correlation values between the event sequence and 200 surrogate 
templates obtained by shuffling the order of place cells, on the other* (Fig. 2a). The 
significance level was set at 0.025 to control for multiple comparisons (two direc- 
tions of run). The proportions of significant events (preplay novel track, preplay 
novel arm (Fig. 2b), replay novel arm and replay familiar track) were each calcu- 
lated as the ratio between the number of significant events and the total number of 
spiking events in which at least four corresponding place cells were active’. 
Corresponding familiar track templates (Fig. 2h) were constructed by ordering 
the location of peak firing on the familiar track during Fam-Run (no minimum 
threshold of firing) of all place cells that subsequently fired on the novel arm. Cells 
comprising the corresponding familiar track templates are the same as those 
comprising the novel arm templates. We note that these corresponding familiar 
track templates are different from the ones used in Figs 1 and 2a-g, which were 
constructed by ordering the peak firing of all place cells active on the familiar track 
>1Hz. 

The overall significance of the preplay (Fig. 2a) or replay process was calculated 
by comparing the distribution of correlation values of all events relative to the 
original template with the distribution of correlation values relative to the shuffled 
surrogate templates, using the Kolmogorov-Smirnov test*. Quantification of the 
replay versus preplay events during the Fam-Run session (Fig. 2f, g) was per- 
formed as described above using different spatial templates for the familiar track 
and the novel arm. All spiking events were correlated with both the novel arm and 
the familiar track templates. Events significantly correlated only with familiar 
track or with novel arm templates were considered pure replay and pure preplay, 
respectively. The template specificity index was calculated for each event as the 
difference between the absolute value of the event’s correlation with the novel arm 
template (preplay, high positive index) and the event’s correlation with the famil- 
iar track template (replay, high negative index). For the purpose of displaying the 
template specificity index, events correlated with the novel arm but not with the 
familiar track templates were considered preplay and events correlated with the 
familiar track but not with the novel arm templates were considered replay 
(Fig. 2g). Additionally, events correlated with both the familiar track and the novel 
arm templates formed a third group, preplay/replay events, displayed in yellow in 
the inset of Fig. 2g. 

Correlations between pairs of familiar track and novel arm templates (Fig. 2h) 
were performed using modified familiar track templates that were constructed 
using the location of peak firing (>0 Hz) of only those cells that had place fields on 
the novel arm (peak rate, >1 Hz). The lack of significant correlation in this case 
demonstrates that the novel arm place cell sequence is not simply a transposition 
of a familiar track place cell sequence on the novel arm. 

We also identified neurons that did not fire during Fam-Run, that activated 
during Fam-Rest events and that corresponded to trajectories on the novel arm 
during Contig-Run (silent cells). We calculated the correlation between the order 
in which they fired during Fam-Rest events and their spatial sequence as new place 
cells on the novel arm during Contig-Run, as previously explained. Owing to the 
low absolute number of silent neurons, only triplets of cells were available for 
further analysis (n = 24). The proportion of events perfectly matching the spatial 
template was compared with the proportion of by-chance perfect matching (0.33). 
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Stability of place cell maps. Stabilities of place cell firing on the familiar track 
before and after barrier removal as well as on the novel track (de novo condition) 
and the novel arm (Contig condition) in the beginning versus the end of the run 
session were assessed by calculating, for each place cell and each direction, a 
correlation between the spatial firing in the corresponding paired situations 
(before versus after barrier removal for the familiar track or the first four laps 
versus the last four laps of the De novo-Run or Contig-Run session for the novel 
track or arm, respectively). The place cell activity was not partitioned in place 
fields; rather, the whole activity on the particular track or arm was considered 
separately for each cell and direction (average correlations are shown in Figs 2e 
and 4D, blue bars). In addition, we calculated the same type of correlation after 
shuffling the identity of the cell in one member of the correlation (once for each 
different cell; average correlations are in Figs 2e and 4D, black bars). Shuffle results 
(Figs 2e and 4D, black bars) were computed as correlation between spatial tuning 
of cells on the familiar track during Fam-Run and spatial tuning of all other 
simultaneously recorded cells on the familiar arm during Contig-Run (familiar 
track group; Fig. 2e, left), or correlation between spatial tuning of cells on the novel 
arm (or novel track) during the beginning of Contig-Run (or De novo-Run) and 
spatial tuning of all the other simultaneously recorded cells on the novel arm (or 
novel track) during the end of Contig-Run (novel arm group; Fig. 2e, right) or De 
novo-Run (Fig. 4D). Original and shuffled correlations were compared using the 
rank-sum test. The average number of laps (traversal of the novel track in both 
directions) per session was 20.5 in De novo-Run (21, 16, 27 and 18 in the four mice) 
and 16.3 in Contig-Run (13, 14 and 22 in the three mice). 

Bayesian reconstruction of actual and virtual trajectories. For each cell, we 
calculated a linearized spatial tuning curve on the familiar track during the 
Fam-Run session and a linearized spatial tuning curve on the novel arm during 
the Contig-Run session. The tuning curves were constructed in 2-cm bins from 
spikes emitted in both run directions at velocities higher than 5cms~ 1 and were 
smoothed with a Gaussian kernel with a standard deviation of 2cm. We con- 
structed a joint spatial tuning curve for each cell by juxtaposing the spatial tuning 
curve on the familiar track during the Fam-Run session and the spatial tuning 
curve on the novel arm during the Contig-Run session. We also detected for each 
cell all the spiking activity emitted at velocities below 5cms | during the Fam- 
Rest session, where replay and preplay events where shown to occur using the 
rank-order correlation method. We used a Bayesian reconstruction algorithm®'* 
to decode the virtual position of the animal from the spiking activity during Fam- 
Rest (Fig. 3b) in non-overlapping, 20-ms bins using the joint spatial tuning curves. 
We then extracted epochs of reconstructed trajectory matching the time of the 
spiking events as detected using multiunit activity of place cells from the familiar 
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track and novel arm (rank-order correlation method; see ‘Preplay and replay 
analyses’, above). 

We used two shuffling procedures to measure the quality of the Bayesian 
decoding. In the first shuffling procedure, for each event, the original time-bin 
columns of the probability distribution function (PDF) were replaced with an 
equal number of time-bin columns randomly extracted from a pool containing 
the time-bin columns of all PDFs of all detected events®. The shuffling procedure 
was repeated 500 times. In the second shuffling procedure, the identity of the place 
cells was randomly shuffled 100 times and new PDFs were calculated for all events. 
For all original and shuffled PDFs, a line was fitted to the data using a previously 
described line-finding algorithm’. Lines fitted to the original and shuffled data 
were compared using slope, spatial extent, location on the track and probability 
score. We defined replay and preplay as the epochs of Fam-Rest in which the 
reconstructed trajectory was located on the familiar track or the novel arm, 
respectively. The trajectory was defined across a set of position estimates during 
the corresponding epoch (Fig. 3c). Only epochs that lasted at least 60 ms (three 
bins) and which contained reconstructed trajectories spanning at least 10 cm were 
considered for further analysis. Trajectories for which 75% or more of their length 
was located on the familiar track were considered to represent replay of an animal’s 
trajectory on the familiar track (Fig. 3c, middle), and trajectories for which 75% or 
more of their length was located in the novel arm were considered to represent 
preplay of the animal’s future trajectory on the novel arm (Fig. 3c, top). The 
remaining events were considered preplay-replay (Fig. 3c, bottom). 

An epoch was considered significant if the new line was less than 75% contained 
in the familiar track for replay or novel arm for preplay in at least 95% of the 
shuffled cases. For each epoch that was significant for replay or preplay using the 
reconstruction method, we retrieved the value of the rank-order correlation 
between the neuronal firing sequences and the familiar track and novel arm spatial 
templates as calculated using the rank-order correlation method. We compared 
the absolute correlation values between the epoch’s firing sequences and familiar 
track templates with the absolute correlation values between the same epoch’s 
firing sequences and novel arm templates. We also reconstructed the trajectory of 
the animal on the familiar track from the spiking activity during the Fam-Run 
session at velocities above 5 cms‘ in 250-ms bins using the spatial tuning curves 
on the familiar track*’* (Fig. 3a) to validate the decoding procedure. 
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Development of asymmetric inhibition underlying 
direction selectivity in the retina 


Wei Wei!, Aaron M. Hamby’, Kaili Zhou! & Marla B. Feller’? 


Establishing precise synaptic connections is crucial to the develop- 
ment of functional neural circuits. The direction-selective circuit 
in the retina relies upon highly selective wiring of inhibitory inputs 
from starburst amacrine cells’ (SACs) onto four subtypes of ON- 
OFF direction-selective ganglion cells (DSGCs), each preferring 
motion in one of four cardinal directions’. It has been reported 
in rabbit that the SACs on the ‘null’ sides of DSGCs form func- 
tional GABA (y-aminobutyric acid)-mediated synapses, whereas 
those on the preferred sides do not*®. However, it is not known 
how the asymmetric wiring between SACs and DSGCs is estab- 
lished during development. Here we report that in transgenic mice 
with cell-type-specific labelling, the synaptic connections from 
SACs to DSGCs were of equal strength during the first postnatal 
week, regardless of whether the SAC was located on the preferred or 
null side of the DSGC. However, by the end of the second postnatal 
week, the strength of the synapses made from SACs on the null side 
of a DSGC significantly increased whereas those made from SACs 
located on the preferred side remained constant. Blocking retinal 
activity by intraocular injections of muscimol or gabazine during 
this period did not alter the development of direction selectivity. 
Hence, the asymmetric inhibition between the SACs and DSGCs is 
achieved by a developmental program that specifically strengthens 
the GABA-mediated inputs from SACs located on the null side, ina 
manner not dependent on neural activity. 

The ability to detect motion in the visual scene is a fundamental 
computation in the visual system that is first performed in the retina. 
Motion direction is encoded by DSGCs, which fire a maximum num- 
ber of action potentials during movement in their preferred direction, 
but fire minimally for movement in the opposite, or null, direction*®. 
In the mammalian retina, the directional preference of an ON-OFF 
DSGC is caused by asymmetric inhibitory inputs: movement in the 
null direction causes strong inhibition that effectively shunts light- 
evoked excitatory inputs. Indeed, blocking GABA, receptors abolishes 
the directionality of DSGCs by increasing spiking in response to null- 
direction motion®*. Null-side inhibition is thought to arise from SACs 
because their processes cofasciculate with DSGC dendrites”"®, where 
they form direct GABAergic synapses’, and because ablation of SACs 
eliminates the directional preference of DSGCs""””. 

How SAC-DSGC synapses are organized to provide asymmetric 
inhibition has been an intriguing but difficult question because no 
apparent asymmetry is detected in the morphology or the distribution 
of synaptic markers in DSGCs and SACs'*’. The first and only piece of 
evidence for the synaptic basis of asymmetric inhibition came from a 
functional study between SAC and DSGC pairs in rabbit retina’, which 
suggested that SACs on the null side provide inhibitory inputs to the 
DSGCs but that those on the preferred side do not. Whether this asym- 
metric inhibition exists in the mouse is not known. In addition, because 
the directional preference of an ON-OFF DSGC is present by eye open- 
ing’** and the identification of DSGCs and their preferred directions is 
almost impossible before the onset of the light response, little is known 
about the developmental program that shapes the SAC-DSGC synapses. 


Here we use paired recordings and morphological reconstructions 
from a double-transgenic mouse line that selectively expresses two var- 
iants of green fluorescent protein (GFP) in SACs and nasal-preferring 
ON-OFF DSGCs (nDSGCs) to characterize the organization and the 
development of the precise wiring between SACs and DSGCs. These 
mice were generated by crossing two existing lines: Drd4-GFP mice, 
where Drd4 promoter-driven GFP expression is restricted to nDSGCs"”, 
and mGluR2-GFP mice (mGluR2 also known as Grm2), where a 
membrane-tethered human interleukin-2%/GFP fusion protein is 
expressed specifically in SACs in the retina (Fig. 1a)”°. 

To detect functional GABAergic synapses between SACs and DSGCs, 
we performed targeted whole-cell voltage-clamp recordings from SAC- 
DSGC pairs in whole-mount retinas. To isolate GABAergic synapses, 
paired recordings were carried out in the presence of drugs that block 
excitatory synaptic transmission (Fig. 1b). Alexa dyes were included in 
the recording pipettes to visualize the dendritic morphology of the 
recorded pairs (Fig. 1c). Only pairs with overlapping dendritic fields 
were used for analysis. 

Paired recordings were carried out in postnatal-day-4 (P4), P7, P14 
and adult mice. At P4, GABAergic currents elicited by SAC depolariza- 
tion were detected in nDSGCs in 64% of pairs (16 of 25 pairs; Fig. 1b, d), 
indicating that synapse formation between SACs and nDSGCs 
occurred before and during the first postnatal week, confirming pre- 
vious findings'®. By P7, nearly all pairs showed unitary GABAergic 
connections (P7: 85%, 29 of 34 pairs), and this high level of connectivity 
persisted into adulthood (P14-48: 91%, 41 of 45 pairs; Fig. 1d). The 
evoked response was completely blocked by the GABA, receptor 
antagonist gabazine (5M, n = 4; data not shown), indicating that 
the GABAergic transmission between SACs and nDSGCs is mediated 
by GABA, receptors. We note that the finding that connections were 
readily detected between SACs located on the preferred side of DSGCs 
in adult mice is in contrast to previous findings in rabbit’. 

Though SACs located on both the preferred side and the null side 
formed GABAergic synapses with DSGCs, a significant asymmetry in 
the unitary synaptic strength emerged along the null-preferred axis 
during the second postnatal week. Synaptic strength was quantified 
as the GABA,-receptor-mediated whole-cell conductance. These 
measurements were restricted to the null-side and preferred-side pairs 
that had similar amounts of overlap between SAC processes and 
DSGC dendrites. Unexpectedly, at P4 and P7 the GABAergic conduc- 
tances from both groups were similar (Fig. 2). However, a significant 
increase in unitary conductance was detected in the null-side pairs but 
not in the preferred side pairs in retinas at P14 and older (Fig. 2). 
Hence, the establishment of the direction-selective circuits is mediated 
by an asymmetric increase in the strength of the unitary conductance 
between SACs and DSGCs in the week before eye opening. 

The difference in GABAergic conductance from the null- and 
preferred-side SACs prompted us to examine two possibilities regarding 
the mechanisms underlying this strengthening. First we tested whether 
this functional asymmetry was correlated with the number or quality of 
contacts between SACs and nDSGCs, indicating a preferential adhesion 
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Figure 1 | nDSGCs receive direct GABAergic inputs from SACs located on 
the null and the preferred side from P4 until adult. a, Fluorescence image of 
the ganglion cell layer from a P30 Drd4-GFP/mGluR2-GFP mouse, showing 
the bright membrane-bound GFP expressed under the mGluR2 promoter in 
the SACs and the dim cytoplasmic GFP driven by the Drd4 promoter in the 
nDSGC. Scale bar, 25 tm. b, Paired whole-cell voltage-clamp recordings of 
GABAergic currents in a P4 nDSGC (lower traces) evoked by depolarization of 
a SAC from the null side (upper traces) in the presence of the NMDA (N- 
methyl-p-aspartate) receptor antagonist D(—)-2-amino-5-phosphonovaleric 
acid (AP5), «-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid 
(AMPA)/kainate receptor antagonist 6,7-dinitroquinoxaline-2,3-dione 
(DNQX) and «4-containing nicotinic acetylcholine receptor antagonist 
dihydro-B-erythroidine (DHBE). SACs were depolarized from —60 to 0 mV, 


between SACs located on the null side and DSGCs’, which is an import- 
ant mechanism for dendritic differentiation and synaptogenesis in other 
systems*'”’. After electrophysiological recording, the dendritic arbori- 
zations of the synaptically connected, Alexa-dye-filled SAC-nDSGC 
pairs from P14 to P48 were imaged live with a two-photon microscope 
and reconstructed using NEUROLUCIDA (Fig. 3a). We examined the 
overlapping region between the nDSGC dendrites and the distal portion 
(roughly the outer third) of the SAC processes enriched in varicosities, 
which are the sites of neurotransmitter release’. Crossing points 
between distal SAC processes and nDSGC dendrites were defined as 
‘contacts’ (Fig. 3a, inset). A subset of contacts exhibited cofascicula- 
tion’’°**, which were defined as 2-l1m segments along which the pro- 
cesses from the two cells remained in contact (Fig. 3a, inset). The 
null- and preferred-side SAC-nDSGC pairs showed a similar density 
of contacts (Fig. 3b) and cofasciculations (Fig. 3c). No asymmetry was 
found when all of the SAC processes were included in the above analysis 
(Supplementary Fig. 1). Therefore, the functional asymmetry in 
GABAergic synapses does not involve selective adhesion between 
null-side SAC processes and DSGC dendrites™. 

The second possibility we tested was whether spontaneous retinal 
activity during the second postnatal week has a role in the establishment 


which reliably evoked an inward current in SACs. The postsynaptic GABAergic 
currents were recorded in DSGCs at different holding potentials to determine 
the current-voltage relationship of the conductance. c, Example images of 
synaptically connected, dye-filled SAC-DSGC pairs at P4, P7 and P30. The left- 
hand side shows pairs with SACs (green) located on the null side of the DSGCs 
(red). The right-hand side shows preferred-side pairs. Scale bar, 50 um. d, Soma 
locations of the GABAergically connected SAC-nDSGC pairs along the null- 
preferred axis during development. Red spots represent the positions of DSGC 
cell bodies. The positions of SAC cell bodies that form GABAergic synapses 
with their respective nDSGCs are shown as green spots; the SAC cell bodies that 
were not connected to nDSGCs are shown as grey spots. All pairs had 
overlapping dendritic fields. Scale bar, 25 jum. 


of direction selectivity. DSGCs are depolarized by retinal waves, and 
activity could therefore potentially influence the synapse strengthen- 
ing**. To this end, we first confirmed that the GFP-labelled nDSGCs in 
the Drd4-GFP mice showed a clear preference for nasal motion at eye 
opening (Fig. 4a) that was sensitive to the GABA, receptor antagonist 
gabazine (Supplementary Fig. 2), with a direction selectivity index 
similar to those recorded in the adult (Fig. 4b). We then injected 
muscimol, a GABA, receptor agonist, intravitreally into Drd4-GFP 
mice to block all spontaneous and evoked neural activity in the retina”. 
In the presence of muscimol, evoked synaptic transmission from SACs 
to nDSGCs and spontaneous activity in both cell types were completely 
suppressed (Fig. 4c and Supplementary Fig. 3a, b). The effectiveness 
of muscimol injection at blocking activity in vivo was confirmed by 
examining eye-specific segregation of retinogeniculate projections, 
which is an independent measure of retinal activity (Supplementary 
Fig. 3c, d), and the persistence of fluorescently labelled muscimol in 
the retina at 48h post-injection (Supplementary Fig. 3e). 

Weassessed the responses of nDSGCs to stationary flashes and drift- 
ing gratings in P14—15 mice that had received repeated muscimol injec- 
tions in the second postnatal week. Muscimol treatment did not prevent 
the development of direction-selective responses or significantly reduce 
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Figure 2 | GABAergic conductance in the null-side SAC-nDSGC pairs 
strengthens during the second postnatal week. a, Postsynaptic GABAergic 
currents in nDSGCs recorded at holding potentials between —70 and -10 mV in 
response to depolarization (as in Fig. 1b) of null-side (left) and preferred-side 
(right) SACs at P4, P7, P14 and P30. b, Relative soma positions of SAC-nDSGC 
pairs used for conductance analysis at P4, P7 and P14-48. Open circles 
represent nDSGC cell bodies. Filled circles are SAC somas colour-coded for 
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conductance strength normalized to the maximum value across all ages. 
Dashed lines illustrate average dendritic arborization diameter, centred on the 
asterisks, for nDSGCs (white) and SACs (red; asterisks represent average soma 
locations). Scale bar, 25 um. c, Summary plot of GABAergic conductances of 
the null- and preferred-side SAC-nDSGC pairs at P4, P7 and P14-48. 
Individual pairs (black) and mean + s.d. (red) are shown. One-way analysis of 
variance: P< 0.0001; t-test: *P < 0.0001, **P = 0.0003. 
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Figure 3 | Dendritic contacts and cofasciculations between SACs and 
nDSGCs occur at similar densities for the null- and preferred-side pairs. 
a, NEUROLUCIDA reconstructions of the dendrites from the on sublamina 
and side views of the complete dendritic arborizations from a null-side (left) 
anda preferred-side (right) pair of SACs and nDSGCs. Dots represent dendritic 
contacts, with cofasciculation segments coloured white and the rest coloured 
purple. The GABAergic conductances for the null- and preferred-side pairs are 
indicated. Scale bar, 25 um. Inset, fluorescence image of the outlined region 
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showing crossing contacts (arrows) and cofasciculation (arrowhead). Scale bar, 
5 um. b, Summary plot of the density of total contacts between DSGCs and 
distal SAC processes (roughly the outer third) from the null or preferred side 
from P14 to P48. Individual pairs and mean = s.d. are shown. The data points 
for P28 and later are coloured blue, and the ones for before P28 are coloured 
black. c, Summary plot of the density of cofasciculations between nDSGCs and 
distal SAC processes from the same pairs as in b. Null- and preferred-side 
groups are not significantly different in b and c. P > 0.7, t-test. 
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Figure 4 | Intraocular injections of muscimol or gabazine do not alter 
direction selectivity in nDSGCs. a, The normalized spike vector sums of 
nDSGCs in response to drifting gratings of 12 directions from P14—15 Drd4- 
GFP mice that received either no treatment (control) or intraocular injections 
of saline, muscimol or gabazine from P6 to P12. D, dorsal; N, nasal; T, temporal; 
V, ventral. The red lines are mean vector sums of all cells in each group. Insets, 
examples of normalized tuning curves of single cells, with corresponding vector 
sums represented as red lines of nDSGCs from each group. Control: n = 4 mice, 
12 cells; saline: n = 11 mice, 43 cells; muscimol: n = 12 mice, 25 cells; gabazine: 
n= 4 mice, 17 cells. b, Summary plot of direction selectivity index (DSI) for 


directional tuning of nDSGCs (Fig. 4a, b). Normal ON and OFF light 
responses were also present in the muscimol-treated group, although 
there was an increase in the number of cells that did not respond to 
gratings in the muscimol-treated group (Supplementary Fig. 4). 

In visual cortex, activation of GABA, receptors is required for 
maturation of GABAergic synapses”. To test the hypothesis that 
GABA, receptor activation is required for the development of direction 
selectivity, we performed intravitreal injections of the GABA, receptor 
antagonist gabazine into Drd4-GFP mice during the second postnatal 
week. Gabazine treatment did not prevent the development of 
direction-selective responses of GFP-positive cells to drifting gratings 
(Fig. 4a, b). Therefore, the development of direction selectivity arises 
independently of the activation of GABA, receptors. 

To begin exploring the synaptic basis of this increase in conductance 
between null-side SACs and DSGCs, we recorded the spontaneous 
inhibitory postsynaptic currents (IPSCs) from nDSGCs at P7 and 
P14. We found a significant increase in the frequency and no signifi- 
cant change in the amplitude of the GABAergic IPSCs, although there 
was a trend towards larger IPSC amplitudes at P14 (Supplementary 
Fig. 5). This result is consistent with the hypothesis that the stronger 
GABAergic unitary conductance for the null-side pairs is primarily 
due to increased numbers of functional GABAergic synapses. 
However, we cannot tell whether the spontaneous IPSCs originate 
from the null- or preferred-side SACs. Further study is required to 
determine the relative role increases in synapse number versus synapse 
strength have in the increase in the unitary conductance between null- 
side SACs and DSGCs. 

Our study demonstrates that asymmetric inhibition arises during 
the second postnatal week through selective strengthening of the 
GABAergic conductance from SACs on the null sides of DSGCs. 
Morphological analysis revealed a similar degree of dendritic contact 
and cofasciculation between SACs on the null or preferred side, indi- 
cating that the synapse development is dissociated from physical 
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adult (>P28), P14-15 untreated, saline, muscimol and gabazine-treated 
groups. Bars show mean + s.e.; open circles represent individual cells. Adult 
data are reproduced from ref. 19. c, Example traces from whole-cell voltage- 
clamp recordings of inhibitory (upper traces, Vy = 0 mV) and excitatory 
(lower traces, Vy = —50 mV) currents from a P14 nDSGC in drug-free 
artificial cerebrospinal fluid (control, left) or artificial cerebrospinal fluid 
containing 100 1M muscimol (right). Deflections from baseline correspond to 
spontaneous synaptic currents. At depolarized potentials, application of 
muscimol activated a tonic current, which was measured as a change in the 
baseline holding current”. 


encounters between SAC processes and DSGC dendrites, as was 
recently found in barrel cortex”®. 

In addition, we found that blocking depolarization-induced activity 
or GABA, receptor activation did not affect the establishment of 
direction selectivity in the retina, in sharp contrast to direction-selective 
cells in the visual cortex”’. This finding lends support to previous studies 
showing that early visual experience’*”* or cholinergic retinal waves’* 
are not involved in establishing retinal direction selectivity. Therefore, 
the mechanism underlying the development of retinal direction selec- 
tivity is an asymmetric increase in the strength of the inhibitory unitary 
conductance between SACs and DSGCs in the week before eye opening, 
without the establishment of asymmetrical dendritic contacts and inde- 
pendent of spontaneous neural activity. 


METHODS SUMMARY 


We performed dual whole-cell voltage-clamp recordings from SAC-nDSGC pairs in 
oxygenated artificial cerebrospinal fluid at 32-34 °C containing 119.0mM NaCl, 
26.2mM NaHCO;, 11mM glucose, 2.55mM KCl, 1.0mM K,HPO,, 2.5mM 
CaCl, 1.3mM MgCh, 0.05mM AP5, 0.02mM DNQX and 0.008mM DHBE. 
Recording electrodes of 3-5 MQ were filled with an internal solution containing 
110 mM CsMeSQ,, 2.8 mM NaCl, 4mM EGTA, 5mM TEA-Cl, 4mM adenosine 
5'-triphosphate (magnesium salt), 0.3 mM guanosine 5'-triphosphate (trisodium 
salt), 20mM HEPES and 10mM phosphocreatine (disodium salt), 0.025mM 
Alexa 488 (for SACs) and 0.025 mM Alexa 594 (for nDSGCs), pH 7.2. Data were 
acquired using PCLAMP 10 recording software and a Multiclamp 700A amplifier 
(Molecular Devices). The GABAergic conductance was calculated from the linear 
portion of the current-voltage curve for the SAC-evoked currents in nDSGCs. After 
recording, we imaged the dye-filled SACs and the nDSGCs using a custom-modified 
two-photon microscope as described previously’ (FluoView 300, Olympus 
America) at 745 nm. Images were acquired at z intervals of 0.5 jum using a X60 
objective (Olympus LUMPlanFI/IR X60/0.90W). SAC and nDSGC processes were 
reconstructed from image stacks with NEUROLUCIDA. For in vivo injections, we 
anaesthetized animals with 3.5% isoflurane/2% Op. The eyelid was then opened with 
fine forceps, and 1 pl of 10 mM muscimol, 500 uM gabazine or saline was injected 
using a fine glass micropipette. Injections were made with a picospritzer (World 
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Precision Instruments) generating 20-p.s.i., 3-ms-long positive pressure. We 
repeated this procedure every 48 h, starting at P6 and ending at P12. Whole-mount 
retina preparation and two-photon targeted recording for light responses was per- 
formed according to previously described techniques*. The direction selectivity 
index was computed as previously described’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. Drd4-GFP mice in the Swiss Webster background were obtained from 
MMRRC” (http://www.mmrrc.org/strains/231/0231.html), and mGluR2-GFP 
mice were a gift from Shigatada Nakanishi, Osaka. Both strains were backcrossed 
to the C57BL/6 background in our laboratory. The Drd4-GFP/mGluR2-GFP 
double-transgenic mice were obtained by crossing the two single-transgenic lines. 
Whole-cell patch-clamp recording. Single or dual whole-cell voltage-clamp record- 
ings from SACs and nDSGCs were performed in oxygenated artificial cerebrospinal 
fluid at 32-34 °C containing 119.0mM NaCl, 26.2mM NaHCOs, 11 mM glucose, 
2.5mM KCl, 1.0mM K,HPO,, 2.5mM CaCh, 13mM MgCh, 0.05mM AP5, 
0.02 mM DNQX and 0.008mM DHBE. Recording electrodes of 3-5 MQ were 
filled with an internal solution containing 110mM CsMeSQu,, 2.8mM NaCl, 
4mM EGTA, 5 mM TEA-Cl, 4 mM adenosine 5'-triphosphate (magnesium salt), 
0.3 mM guanosine 5’-triphosphate (trisodium salt), 20 mM HEPES and 10 mM 
phosphocreatine (disodium salt), 0.025 mM Alexa 488 (for SACs) and 0.025 mM 
Alexa 594 (for nDSGCs), pH 7.25. Data were acquired using PCLAMP 10 record- 
ing software and a Multiclamp 700A amplifier (Molecular Devices), filtered at 
4kHz and digitized at a sampling rate of 10 kHz. The GABAergic whole-cell 
conductance was calculated from the linear portion of the current-voltage curve 
for the SAC-evoked currents in nDSGCs and analysed using MATLAB software. 
Two-photon targeted loose-patch recording of GFP-positive neurons for light 
response. Drd4-GFP mice were anaesthetized with isofluorane and decapitated in 
accordance with the UC Berkeley Institutional Animal Care and Use Committee 
and in conformance with the NIH Guide for the Care and Use of Laboratory 
Animals, the Public Health Service Policy and the SFN Policy on the Use of 
Animals in Neuroscience Research. Under infrared illumination, retinas were iso- 
lated from the pigment epithelium in oxygenated Ames’ medium (Sigma), cut into 
dorsal and ventral halves, and mounted over a hole of 1-1.5 mm” on filter paper 
(Millipore) with the photoreceptor layer facing down. Retinas were kept in darkness 
at 25°C in Ames’ medium bubbled with 95% O2/5% CO2 until use (0-7h). 
Recording electrodes of 3-5 MQ were filled with Ames’ medium. GFP fluorescence 
was detected with a custom-built, FluoView-based two-photon microscope and a 
Ti:sapphire laser (Coherent) tuned to 920 nm, a wavelength that minimally acti- 
vates mouse photoreceptors and therefore preserves light response. GFP cells were 
then targeted for loose-patch recordings using PCLAMP 10 recording software and 
a Multiclamp 700A amplifier. 
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Visual stimuli were generated as previously described’. Briefly, a white, mono- 
chromatic organic light-emitting display (OLEDXL, eMagin; 800 x 600 pixel 
resolution, 85-Hz refresh rate) was controlled by an Intel Core Duo computer 
with a Windows XP operating system. Drifting square-wave gratings (spatial 
frequency, 225m per cycle; temporal frequency, 4cycless '; 30°s ' in 12 
pseudorandomly chosen directions spaced at 30 intervals, with each presentation 
lasting 3 s and followed by 500 ms of grey screen) were generated from the OLED 
using MATLAB and the Psychophysics Toolbox, and were projected through the 
X60 water-immersion objective (LUMPlanFl/IR, NA 0.9) via the side port of the 
microscope, centred on the soma of the recorded cell and focused on the photo- 
receptor layer. Loose-patch recordings were obtained during the stimulus pre- 
sentation and analysed using MATLAB. A detailed, step-by-step protocol of the 
two-photon targeted recording of light response can be found in ref. 30. 
Two-photon microscopy and morphological reconstruction. After paired 
recording, the Alexa-488-filled SACs and the Alexa-594-filled nDSGCs in the 
Drd4-GFP/mGluR2-GFP mice were imaged using the two-photon microscope 
at 745 nm. At this wavelength, GFP is not efficiently excited but both Alexa 488 
and Alexa 594 are brightly fluorescent. Therefore, the morphology of the Alexa- 
488-filled SACs could be distinguished from the very weak GFP fluorescence. 
Image stacks were acquired at z intervals of 0.5 um and resampled three times 
for each stack using a X60 objective (Olympus LUMPlanFl/IR X60/0.90W), 
covering the entire dendritic fields of the SACs and nDSGCs. Image stacks from 
25 SAC-nDSGC pairs were then imported into NEUROLUCIDA (MBF 
Biosciences) and reconstructed in three dimensions. The densities of contacts 
and cofasciculations were measured from the reconstructions. 

Intraocular injections. Drd4-GFP animals were anaesthetized with 3.5% isoflurane/ 
2% Op. The eyelid was then opened with fine forceps, and 1 pl of 10 mM muscimol 
(Tocris), 500 1M gabazine (Tocris) or saline was injected using a fine glass micro- 
pipette. Injections were made with a picospritzer (World Precision Instruments) 
generating 20-p.s.i., 3-ms-long positive pressure. To prevent efflux of the injected 
solution, removal of the pipette tip from the eye was done slowly and gentle pressure 
was then applied to the injection site with a sterile cotton swab for ~10s. This 
procedure was repeated every 48 h, starting at P6 and ending at P12. 

Statistical analysis. Grouped data are presented as mean ~ s.d. or s.e.m. as indi- 
cated. Data sets were tested for normality, and statistical differences were exam- 
ined using one-way analysis of variance and post hoc comparisons using Student’s 
t-test with Bonferroni corrections (MATLAB). 
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Spatially asymmetric reorganization of inhibition 
establishes a motion-sensitive circuit 


Keisuke Yonehara', Kamill Balint', Masaharu Noda?*, Georg Nagel‘, Ernst Bamberg®° & Botond Roska! 


Spatial asymmetries in neural connectivity have an important role 
in creating basic building blocks of neuronal processing’”. A key 
circuit module of directionally selective (DS) retinal ganglion cells 
is a spatially asymmetric inhibitory input from starburst amacrine 
cells*°. It is not known how and when this circuit asymmetry is 
established during development. Here we photostimulate mouse 
starburst cells targeted with channelrhodopsin-2 (refs 6-8) while 
recording from a single genetically labelled type of DS cell”’°. We 
follow the spatial distribution of synaptic strengths between 
starburst and DS cells during early postnatal development before 
these neurons can respond to a physiological light stimulus, and 
confirm connectivity by monosynaptically restricted trans-synaptic 
rabies viral tracing. We show that asymmetry develops rapidly over 
a 2-day period through an intermediate state in which random 
or symmetric synaptic connections have been established. The 
development of asymmetry involves the spatially selective reorgani- 
zation of inhibitory synaptic inputs. Intriguingly, the spatial 
distribution of excitatory synaptic inputs from starburst cells is 
significantly more symmetric than that of the inhibitory inputs 
at the end of this developmental period. Our work demonstrates 
a rapid developmental switch from a symmetric to asymmetric 
input distribution for inhibition in the neural circuit of a principal 
cell. 

DS retinal ganglion cells respond to movement in a ‘preferred’ 
direction with robust spiking, but show minimal response to move- 
ment in the opposite, or ‘null’, direction®’’'’. DS cells receive 
GABAergic inhibitory inputs from starburst amacrine cell processes 
pointing in the null direction, but not from those pointing in the 
preferred direction*”*. Glutamatergic excitatory input from bipolar 
cells is also directionally selective. Interestingly, starburst cells also 
communicate to DS cells using acetylcholine’*’’, but this excitatory 
connection seems to be symmetric’. Directional selectivity is present 
before eye opening (around postnatal day 13 (P13) in mice), as well as 
in dark-reared animals”'”-””, indicating that the establishment of cir- 
cuit asymmetry does not require visual experience. How and when 
such highly specific synaptic connections are established between star- 
burst and DS cells during development remain unknown. Retinal cells 
do not respond to light until P10-11 in mice’*”, with the exception of 
melanopsin-containing ganglion cells’, which limits the ability to 
follow the early development of functional connectivity. Directional 
selectivity may develop by the asymmetric refinement of previously 
formed inhibitory connections (Supplementary Fig. la) or, alterna- 
tively, the inhibitory synaptic inputs form asymmetrically (Supplemen- 
tary Fig. 1b). 

To distinguish between these possibilities, we probed the spatial dis- 
tribution of synaptic strengths from starburst amacrine cells to individual 
ON DS cells during postnatal development. ON DS cells respond to slow 
movement and are critical for mediating the optokinetic reflex”*. In 
SPIG1-GFP (SPIG1, also known as Fstl4, locus driving green fluorescent 


protein expression) knock-in mice, upward-motion-preferring ON DS 
cells are selectively labelled with GFP throughout development in most 
retinal regions”"®. 

To activate the starburst cells of SPIG1-GFP mice before amacrine and 
ganglion cells receive light-driven inputs from bipolar cells, this mouse 
line was crossed with another line expressing Cre recombinase speci- 
fically in starburst cells (choline acetyltransferase (ChAT)-Cre knock- 
in mice)”. At PO, we transduced these SPIGI-GFP X ChAT-Cre mice 
with a Cre-recombinase-dependent adeno-associated virus (AAV) car- 
rying a reversed and double-floxed C128T mutant channelrhodopsin-2 
(ChR2c)** followed by 2A-DsRed2 (ChR2c-2A-DsRed2, see Methods, 
Fig. 1a). 2A sequence codes for a cis-acting hydrolase element” that 
creates equimolar amounts of ChR2c and red-fluorescent, soluble 
DsRed2. A soluble marker in the cell body allowed easier quantification 
of fluorescence, and therefore ChR2 expression, than in a membrane- 
bound fusion construct. ChR2c-expressing cells are responsive to light at 
an intensity 50-fold lower than cells expressing wild-type ChR2 (ref. 7) 
and could, therefore, be activated by light patterns generated by an over- 
head projector. 

Immunohistochemistry showed that all DsRed2-marked neurons 
were also positive for ChAT, a marker for starburst cells. Conversely, a 
substantial fraction (~60%) of starburst cells in both the ganglion cell 
layer (GCL) and inner nuclear layer (INL) (Fig. 1b, c) were DsRed2- 
labelled. Therefore, starburst cells, but no other cell type in these mice, 
are labelled red and express light-sensitive ChR2c, whereas upward- 
motion-preferring ON DS cells are labelled green (Fig. le). First, we 
characterized the light-excitability of ChR2c-positive starburst cells in 
intact, isolated retinas between P6 and P9. Light illumination evoked 
robust currents in DsRed2-expressing cells, even in the presence of 
glutamatergic synaptic blockers (CPP, NBQX, APB), suggesting 
ChR2c as the source of the currents (Fig. 1d). Increasing illumination 
evoked increasing membrane potential changes in starburst cells and, 
as expected due to the 2A element, the red fluorescence intensity of the 
recorded cell bodies correlated well (R = 0.83) with the magnitude of 
the membrane potential change at the stimulation intensity which is 
used to test the distribution of synaptic strengths in subsequent experi- 
ments (Supplementary Fig. 2). 

To test whether the genetically tagged neural circuit could report the 
synaptic strengths from starburst to ON DS cells, we isolated excitatory 
and inhibitory inputs to ON DS cells at P8 while stimulating ChR2c- 
expressing starburst cells with light patterns (see Methods). A full-field 
light step elicited both inhibitory and excitatory currents in ON DS cells 
(Fig. 1f and Supplementary Fig. 3). The inhibitory component was 
blocked by the GABA receptor antagonist picrotoxin, and the excitatory 
input by the cholinergic receptor antagonist curare. Blocking glutamate 
receptors had no effect on the light-evoked currents (Fig. 1f) or on the 
miniature excitatory postsynaptic currents (mEPSC, Supplementary 
Fig. 4). mEPSCs were blocked by curare (Supplementary Fig. 4). 
These results confirmed that ON DS cells receive GABAergic and 
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Figure 1 | Targeting of ChR2c to starburst amacrine cells at P8. a, AAV 
vector. EFla, promoter; ITR, inverted terminal repeat; WPRE, woodchuck 
post-transcriptional regulatory element. b, Confocal images from an AAV- 
transduced retina. c, Relationship between DsRed2-expressing and ChAT- 
positive cells. d, Top, excitatory currents inan AAV-labelled starburst cell in the 
presence of synaptic blockers. Full-field flash stimulus. Bottom, confocal image 
of the recorded starburst cell. e, Top, confocal image ofa retina in which ON DS 
cells are expressing GFP (green) and starburst amacrine cells are expressing 
ChR2c and DsRed2 (magenta). Bottom, side-view. IPL, inner plexiform layer. 
f, Synaptic currents recorded at —60 mV (red) and 20 mV (blue) holding 
potentials from a GFP-positive ON DS cell in response to a full-field flash. Error 
bars, s.d. 


cholinergic synaptic inputs in response to starburst cell stimulation, but 
do not receive glutamatergic synaptic input from bipolar cells at this 
stage. 

The ChR2c-assisted synaptic strength mapping depends on direct 
connections between starburst and ON DS cells during early postnatal 
development. To test whether this is the case, we performed mono- 
synaptically restricted retrograde synaptic tracing with G-deleted rabies 
virus’’** complemented with G-expressing herpes virus initiated from 
GFP-labelled ON DS cells (see Methods, Supplementary Figs 5 and 6). 
At P6, starburst cells were rabies-labelled around infected GFP-marked 
ON DS cells (Fig. 2), indicating that starburst cells are directly con- 
nected to ON DS cells at this developmental stage. 

Having confirmed monosynaptic connection from starburst cells 
already at P6, we investigated the spatial distribution of the strength 
of synaptic connections by stimulating starburst cells with light steps in 
eight sectors surrounding the recorded ON DS cells (Fig. 3). We calcu- 
lated a spatial asymmetry index (SAI) that quantified the degree of 
spatial asymmetry of the synaptic inputs to ON DS cells along the 
dorso-ventral axis (see Methods). We found that the inhibitory input 
was already spatially asymmetric along the dorso-ventral axis by P8; 
stimulation of the ventral (null) side evoked more inhibitory current 
than stimulation of the dorsal (preferred) side. In contrast, the excitatory 
input was significantly more symmetric along the same axis (Fig. 3 and 
Supplementary Fig. 7). To avoid potential bias due to non-uniform viral 
transduction we normalized the synaptic currents with either the num- 
ber of DsRed2-expressing starburst cells (using a threshold) or the sum 
of the measured red fluorescence (which reflects the voltage change in 
starburst cells, as shown before, Supplementary Fig. 2) of the starburst 
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Figure 2 | Monosynaptically restricted circuit mapping initiated from ON 
DS cells. a, b, Live images of SPIG1-GFP retinas at P6 in which GFP-labelled 
ON DS cells (green) were infected with G-deleted rabies expressing mCherry 
(magenta) either alone (a) or in combination with G-encoding herpes virus 
(b). c, Confocal images of a GFP-labelled ON DS cell (cyan) infected with 
G-deleted rabies virus only (magenta). d, Confocal images of an ON DS cell 
infected with both G-deleted rabies virus and G-encoding herpes virus from 
b. Most of the labelled presynaptic cells (arrows) are ChAT-positive starburst 
cells (yellow) in the GCL. 


cells in each of the eight sectors in which the light stimulus was pre- 
sented (see Methods). The normalized responses, like the recorded raw 
responses, also showed asymmetric inhibition and more symmetric 
excitation along the dorso-ventral axis at P8 (Fig. 3, Supplementary 
Figs 7 and 8). In contrast to P8, the raw and normalized inhibition 
and excitation at P6 was symmetric along the same axis (Fig. 3, Sup- 
plementary Figs 7 and 8). The lack of asymmetry in inhibition at P6 was 
not due to ineffective activation of starburst cells because of low ChR2c 
expression level, since half-maximal activation at P9, which should be 
similar to maximal activation at P6 in terms of eliciting changes of 
membrane potential (Supplementary Fig. 2), revealed asymmetry (Sup- 
plementary Fig. 9) . 

Next, we investigated the emergence of asymmetry from P6 to P9 
(Fig. 4). SAI of inhibition increased significantly, but there was no 
significant change in excitation between any pairs of days (Supplemen- 
tary Fig. 8, note: the lack of stars between pairs of conditions on any of 
the figures means no significant change). The lack of statistically sig- 
nificant change in excitation was not due to saturating intensities 
because half-maximal activation of excitatory inputs to ON DS cells 
did not significantly change the SAI of excitation at P9 (Supplementary 
Fig. 9). The mean direction of inhibitory input of individual recorded 
cells, computed as the vector sum of inputs for all eight directions, was 
random at P6 but became confined to the ventral side by P8 (direction 
of red bars in Fig. 4h). Because the variation in DsRed2 expression 
across the eight sectors was not statistically different between P6 and 
P9 (Supplementary Fig. 10), the randomness at P6 is not due to greater 
variation in gene expression from AAV at earlier time points. We con- 
clude that before P6 the spatial distribution of inhibitory connectivity 
between starburst cells and ON DS cells is either random or symmetric 
(with some synapses having little strength). Inhibitory connectivity 
rapidly reorganizes to become asymmetric along the ‘preferred-null’ 
(dorso-ventral) axis between P6 and P8, whereas excitatory cholinergic 
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Figure 3 | ChR2c-assisted circuit mapping at P8 and P6. a-f, Recordings 
from ON DS cells at P8 (a-c) and P6 (d-f). Inhibitory (a, d) and excitatory 
(b, e) postsynaptic currents elicited in an ON DS cell by the stimulation of eight 


input remains significantly more symmetric throughout this develop- 
mental period (Supplementary Fig. 1a). 

Is inhibitory connectivity strengthened at the ventral (‘null’) and 
weakened at the dorsal (‘preferred’) side or is only one of these two 
mechanisms driving the development of asymmetry? Since starburst 
cells similarly control the strength of cholinergic excitation and 
GABAergic inhibition to ON DS cells (Supplementary Fig. 11) and 
excitation is not significantly different along the dorso-ventral axis 
from P6 to P9, the ratio of inhibition to excitation (neither normalized) 
in the dorsal and ventral sides should be a measure of the inhibitory 
synaptic strength that depends less on the level of ChR2c expression 
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Figure 4 | Development of asymmetry. Coloured lines indicate normalized 
current responses from individual cells and the black line indicates the mean 
response of all recorded cells (or the mean cell number for a). Polar plot of the 
number of DsRed2-expressing cells (a), excitatory (b) and inhibitory (c) inputs, 
excitatory (d) and inhibitory (e) inputs normalized to the number of DsRed2 


sectors surrounding the cell. Polar plots are also shown. c, f, Spatial asymmetry 
index (SAI) for inhibition and excitation. g, Sketch of light patterns used to 
stimulate one of eight sectors around the recorded ON DS cell. Error bars, s.d. 


(which increases over the days, Supplementary Fig. 2d) than inhibition 
alone. This ratio increased in the ventral (though the increase was not 
significant) and decreased in the dorsal side (Supplementary Fig. 12), 
suggesting that ‘push-pull’ synapse reorganization is at work. 

The retinal stratum in which ON DS cells extend their dendrites 
embodies three different directionally selective computations that lead 
to preferential responses to nasal, upward and downward motion in 
different types of ON DS cells. We suggest that, in the physical space 
shared by these circuits, it is the spatially selective refinement of the 
distribution of inhibitory input strength to each DS ganglion cell that 
underlies the establishment of each directionally selective retinal circuit. 
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expressing cells in each sector, excitatory (f) and inhibitory (g) inputs 
normalized to the mean DsRed2 fluorescence intensity of cells in each sector. 
h, Red bars indicate the vector sum of inhibitory inputs normalized to the 
number of DsRed2 expressing cells in each sector. 
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Higher-order brain computations, for example orientation selectivity in 
the visual cortex, also rely on spatial circuit asymmetries. Mechanistic 
insights from the development of retinal directional selectivity may help 
to understand how asymmetry in cortical circuits is established. 


METHODS SUMMARY 


On the day of birth (PO) starburst amacrine cells in the progeny of a cross between 
mice expressing Cre in starburst amacrine cells and SPIG1-GFP mice expressing 
GFP in ON DS cells were labelled with ChR2c in vivo by transduction with a Cre- 
recombinase-dependent AAV. Retinas were isolated at P6 or later and GFP- 
labelled ON DS cells were recorded in voltage clamp at —60 mV (for excitation) 
or 20 mV (for inhibition)” guided by a two-photon microscope at 930 nm (ref. 30). 
Photostimulation was performed with white light steps (duration 2 s, inter-stimulus 
interval 10 s) generated by a digital light projector (PLUS Vision) and directed onto 
each of eight sectors surrounding the recorded cell. Stereotaxic injections of rabies 
and herpes viruses to the medial terminal nucleus (MTN) of SPIGI-GFP mice were 
performed at P1 and infected retinas were isolated at P6. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. ChAT-Cre mice were purchased from Jackson Laboratory (strain: 
B6;12986-Chat!”™!(""])_ In ChAT-Cre mice, Cre recombinase is expressed 
under the control of the ChAT locus. In SPIGI-GFP mice GFP is expressed under 
the control of the SPIG1 locus””®. To obtain neonatal SPIG1-GFP x ChAT-Cre 
mice we crossed SPIGI-GFP homozygous mice with ChAT-Cre homozygous 
mice. All animal procedures were performed in accordance with standard ethical 
guidelines (European Communities Guidelines on the Care and Use of Laboratory 
Animals, 86/609/EEC) and were approved by the Veterinary Department of the 
Canton of Basel-Stadt, Switzerland. 

AAV plasmids. In the present paper, we refer to the C128T mutant ChR2 (refs 
6-8) as ChR2c. To obtain pAAV-EFla-double floxed-ChR2c-2A-DsRed2- 
WPRE-hGHpA we linearized pAAV-EFla-double floxed-hChR2(H134R)- 
EYFP-WPRE-hGHpA (provided by K. Deisseroth) using Nhel/AscI. ChR2c was 
PCR-amplified from pGEMHE-ChR2c using a HindIII-2A-covering primer. 
Forward primer: 5’-GCTAGCGCTAGCCACCATGGATTATGGAGGCGCCC 
TG-3’. Reverse primer: 5’-TCTCCCGCAAGCTTAAGAAGGTCAAAATTCIT 
GCCGGTGCCCTTGTTGAC-3’. DsRed2 was PCR-amplified from pDsRed2-N1 
(Clontech) using a HindIII-2A-covering primer. Forward primer: 5’-ACCTTC 
TTAAGCTTGCGGGAGACGTCGAGTCCAACCCTGGGCCCATGGCCTCCT 
CCGAGAACGTC-3’. Reverse primer: 5’-GGCGCGCCGGCGCGCCCTATC 
ACAGGAACAGGTGGTGGCG-3’. These two PCR products were then digested 
with Nhel, AscI and HindIII and triple ligation was performed. 

AAV production. Serotype 7 recombinant AAVs were made by Penn Vector 
Core. Penn Vector Core performed the genome copy (GC) number titration (titre: 
5.78 10'GC per ml) using real-time PCR (TaqMan reagents, Applied 
Biosystems). 

Logic of AAV labelling. We used the viral vector AAV EFla-double floxed- 
ChR2c-2A-DsRed? to target the expression of ChR2c to starburst cells expressing 
Cre. In AAV EFla-double floxed-ChR2c-2A-DsRed2, two incompatible loxP 
variants*', loxP and lox2722, flank an inverted version of ChR2c followed by the 
red fluorescent marker DsRed2. In the presence of Cre, a stochastic recombination 
of either loxP variant takes place, resulting in the inversion of ChR2c-2A—DsRed2 
into the sense direction, followed by expression of the ChR2c-2A-DsRed2. The 2A 
element*” codes for a cis-acting hydrolase element” that creates equimolar 
amounts of ChR2c and red-fluorescent DsRed2. 

AAV injection. We injected the virus at the day of birth. SPIGI-GFP  ChAT- 
Cre mice were anesthetized with crushed ice. Virus (1.5 ul, 8.68 X 10° GC) was 
loaded into pulled glass pipettes (tip diameter, 30 1m) and injected into the vitreal 
space of both eyes using a microinjector (Narishige). After 6 days, DsRed2 expres- 
sion was brightly detectable; hence all recordings were performed on retinas at P6 
or later. 

Preparation of retinas. Neonatal mice were killed by decapitation. Eyes were 
enucleated. The retinas were isolated and the pigment epithelium removed in 
Ringer’s medium (in mM: 110 NaCl, 2.5 KCI, 1 CaCh, 1.6 MgCh, 10 p-glucose, 
22 NaHCOs, bubbled with 5% CO2/95% O2, pH 7.4) and mounted ganglion-cell- 
side up on a filter (MF-membrane, Millipore) with a 2-mm rectangular aperture in 
the centre. Before starting superfusion, DsRed2-positive regions together with 
GFP-positive cells were located with an epifluorescence stereomicroscope 
(Olympus) and photographed for later determination of the average DsRed2 
fluorescence intensity around recorded ON DS cells for data normalization and 
orientation of the retina. Only GFP cells surrounded by DsRed2 expression were 
chosen for the recordings. The orientation of the isolated SPIGI-GFP X ChAT- 
Cre retina was determined by the pattern of GFP expression. In most retinal 
regions, GFP is expressed exclusively in one type of ON DS cells during the 
developmental period. An exception is the dorsal (slightly temporal) region, where 
GFP is expressed in many different ganglion and amacrine cell types. A thick axon 
bundle in this region runs in the dorso-ventral direction towards the optic disk and 
can be used as a compass in isolated retinas’. The retinas were superfused in 
Ringer’s medium at 35-36°C in the microscope chamber for the duration of 
the experiment. In this retinal preparation, ChR2c-mediated light responses could 
be measured for more than 8h. 

Two-photon imaging, electrophysiology and pharmacology. Fluorescent cells 
were found with the help of a two-photon microscope equipped with a Mai Tai HP 
two-photon laser (930 or 1,010 nm) (Spectra Physics) integrated into the electro- 
physiological setup*’. Current recordings were made in whole-cell voltage clamp 
mode using an Axon Multiclamp 700B amplifier with borosilicate glass electrodes 
(BF100-50-10, Sutter Instruments) pulled to 7-9 MQ, and filled with (in mM) 
112.5 CsCH3SO3, 1 MgSO, 7.8 X 10 *CaCh, 0.5 BAPTA, 10 HEPES, 4 ATP-Nap, 
0.5 GTP-Na3, 5 lidocaine N-ethylbromide (Qx314-Br), 7.75 neurobiotin chloride, 
pH7.2. Excitatory and inhibitory synaptic currents (‘excitation’ and ‘inhibition’, 
respectively) were separated by voltage-clamping the cell to the equilibrium potential 
of chloride (—60 mV) and unselective cation channels (20 mV), respectively”. For 
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recording mEPSCs, cells were voltage-clamped at —60mV and recorded for 
3-5 min. Voltage recordings from DsRed2-positive starburst cells were made in 
whole-cell current clamp mode with glass electrodes pulled to 7-9 MQ and filled 
with (in mM) 115 K-gluconate, 9.7 KCl, 1 MgCl, 0.5 CaCl, 1.5 EGTA, 10 HEPES, 4 
ATP-Na,, 0.5 GTP-Na3, pH 7.2. In pharmacological experiments, agents were 
bath-applied at the following concentrations: 10 4M CPP ((+)-3-(2-carboxypiperazin- 
4-yl) propyl-1-phosphonic acid, blocking NMDA receptors), 101M NBQX 
(6-nitro-2,3-dioxo-1,4-dihydrobenzo|f] quinoxaline-7-sulfonamide, blocking AMPA 
and kainate receptors), 10 11M APB (L-(+)-2-amino-4-phosphonobutyric acid, 
blocking metabotropic glutamate receptors and therefore blocking the ON 
pathway), 50 uM curare (tubocurarine chloride, blocking nicotinic acetylcholine 
receptors), 100 uM picrotoxin (blocking GABA A and C receptors). All chemicals 
were obtained from Sigma, with the exception of APB (Calbiochem). Data were 
analysed offline with Mathematica (Wolfram Research). 

Photostimulation. ChR2c was activated with light generated by a digital light 
projector (V-332, PLUS Vision). The stimulation was generated via custom-made 
software (Matlab, Mathworks; Labview, National Instruments). Light intensity 
was measured using a spectrometer (USB 4000, Ocean Optics) calibrated with a 
reference source (LS1-Cal, Ocean Optics). Light intensity was modulated by using 
different grey scales (0-255) combined with different neutral density filters (NDO, 
ND10, ND20 and ND 30). To correlate the stimulus intensity and the change in 
membrane potential in ChR2c-expressing starburst cells (Supplementary Fig. 2), 
we used full-field flash at 24 different intensities (12.08-15.86 in Log intensity 
photons cm~* s~') presented for 2 s with an inter-stimulus interval of 5 s. For the 
stimulation of the eight sectors (Figs 3, 4), each stimulus was presented for 2 s with 
an inter-stimulus interval of 10s. The eight sectors were stimulated in a random 
order. The light pattern was focused on the GCL. To find a ‘weak light intensity’ 
that evoked half the maximum excitatory current input to ON DS cell (Sup- 
plementary Fig. 9), full-field flash at 24 different intensities (12.08-15.86 in Log 
intensity photons cm * s” ') was presented sequentially (presentation for 2 s with 
an inter-stimulus interval of 5 s) initially and next the retinas were then stimulated 
in eight sectors using the determined light intensity. 

Data collection and analysis. Light stimulation of each of the eight sectors (Fig. 3g) 
around the recorded ON DS cell body was repeated in each recorded ON DS cell 
3-10 times for both excitation and inhibition and the mean light responses were 
determined for all eight directions. To correct for non-uniform viral expressions in 
the eight sectors, we performed two different types of normalizations. In the first 
procedure, we normalized the current evoked in each sector by the number of 
DsRed2-expressing cells in the sector within 200 jum of recorded ON DS cell bodies 
in the GCL. Here we used an arbitrary fluorescence threshold that was constant 
between experiments. We choose the particular distance of 200 jim because the 
radius of the dendrites of ON DS cells plus the radius of the processes of starburst 
amacrine cells at P6 and P9 were together less than 200 um (data not shown). 

In the second procedure, we normalized each sector to the average fluorescence 
intensity of the starburst cells in the sector and not just the number. This was 
reasonable because the fluorescence intensity of the starburst cells correlated well 
(R = 0.83) with the magnitude of the voltage response of the starburst cell at the 
stimulation intensity used for the mapping procedure (Supplementary Fig. 2): 
therefore, the average fluorescence is a measure of the stimulation strength of 
the starburst cells in the sector. 

To yield the eight quantities plotted on polar plots, these normalized (or not 
normalized) values were further normalized to the largest of the eight numbers 
(for excitation and inhibition, independently). Note that this normalization is 
useful for eliminating variations in synaptic currents arising from the patch-clamp 
technique including series resistance and leak conductance. Note that the largest 
value (of the eight) to which normalization is performed is a mean of a distribution 
since each segment was stimulated 3-10 times (see above). The direction of the 
tuning was determined by multiplying the eight values above with the correspond- 
ing unit vectors pointing in eight directions and then forming the vector sum. The 
direction of the vector sum was interpreted as the direction of tuning. 

The spatial asymmetry index (SAI) was calculated as: 

SAT = (Iventrat ~ I dorsat)/( ventrat + J dorsal) 

where Jventral aNd Igorsai are the amplitudes of the normalized or not normalized 
currents evoked by the stimulation of ventral or dorsal sectors (both normalized 
and not normalized SAIs are shown in Fig. 3 and Supplementary Figs 7-9). 

Monosynaptic restriction of circuit tracing. To create rabies G-expressing 
replication-defective herpes simplex virus-1 (HSV 1), the GFP open reading frame 
(ORF) in the HSV1 vector pR19EFla-GFP-WCm (Biovex) was replaced with that 
of G. First, the G ORF was amplified by PCR from pHCMV-G* using primers 
5'-GTGTCGTGAGGAATTCGTACCGGATCCTCTAGGCCACC-3’ and 5’-CC 
GCTTTACTTGTACATTACAGTCTGGTCTCACCCCCACT-3’. The GFP ORF 
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was removed from pR19EFla-GFP-WCm by EcoRI/BsrGI digestion and the PCR 
fragment of G was recombined into EcoRI-BsrGI site using an in-fusion PCR kit 
(Takara-Clontech). The viral particles were produced by Biovex. G-deleted rabies 
virus encoding mCherry (SADAG-mCherry)* was a gift from E. Callaway. Rabies 
virus expressing mCherry instead of the G glycoprotein was harvested from BHK- 
B19G cells (provided by E. Callaway) and centrifuged as described earlier”. We 
performed stereotaxic surgery to label ganglion cells projecting to the medial 
terminal nucleus (MTN). A cocktail of 10° plaque-forming units of rabies virus 
and 6 X 10* plaque-forming units of HSV1 in 20nl DMEM were loaded into 
pulled-glass pipettes (tip inner diameter of 20-30 1m) and injected into the 
MTN using a microinjector (Narishige, IM-9B). For control experiments, we 
injected 2 X 10° plaque-forming units of rabies virus. Injections were performed 
with 24 mice at P1. Retinas were isolated at P6. Six well infected retinas at P6 were 
fixed by PFA and stained with antibodies. Brains were also isolated and the 
injection sites were localized. All rabies and HSV1 work was carried out under 
Biosafety level 2 conditions. 

The key point of viral tracing was to infect with rabies and herpes viruses an 
upward-motion sensitive ON DS ganglion cell to initiate the retrograde passage of 
rabies viruses from this cell type. Because GFP exclusively labelled upward-motion 
sensitive ON DS cells in the ventral retina of SPIG1-GFP mice it was enough to 
examine the rabies-labelled cells around a GFP-labelled ganglion cell regardless of 
the injection site. The reason we performed the viral tracing in the SPIG1-GFP line 
was to have an internal control for the ganglion cell type for which we examine its 
local circuit. The fact that rabies did label GFP cells (in red) shows that the 
injection reached the MTN (because SPIGI-GFP cells exclusively target 
MTN”"®, see also Supplementary Fig. 6). Even if by mistake we had injected these 
viruses also to other retinorecipient brain regions, this would not compromise our 
tracing results of the GFP-labelled ganglion cells provided, first, that the rabies- 
labelled circuits in the retina were far away so that the ganglion cell to which an 
amacrine cell is presynaptic to could be determined and, second, that only one 
ganglion cell was labelled in that local circuit. The reason we used low rabies titres 
for the herpes/rabies co-injections was to make sure that the circuits analysed were 
far away from each other in the retina (see Fig. 2b). In all circuits we analysed there 
was only one ganglion cell in it (which was GFP-labelled, see Fig. 2d). The defini- 
tion of ‘ganglion cell’ was based on the existence of an axon. 
Immunohistochemistry. After the experiments, retinas were fixed for 30 min in 
4% (w/v) paraformaldehyde in PBS (137mM NaCl, 2.7mM KCl, 4.3mM 
Na,HPOy,, 1.47 mM KH,PO,, pH 7.4) and washed with PBS for at least 1 day at 
4°C. To aid penetration of the antibodies, retinas were frozen and thawed three 
times after cryoprotection with 30% (w/v) sucrose. All other procedures were 
carried out at room temperature. After washing in PBS, retinas were blocked for 
1 hin 10% (w/v) normal donkey serum (NDS; Chemicon), 1% (w/v) bovine serum 
albumin (BSA), and 0.5% (v/v) Triton X-100 in PBS. Primary antibodies were 
incubated for 7 days in 3% (v/v) NDS, 1% (w/v) BSA, 0.02% (w/v) sodium azide 
and 0.5% (v/v) Triton X-100 in PBS. Secondary antibodies were incubated for 2h 
in 3% (v/v) NDS, 1% (w/v) BSA, and 0.5% (v/v) Triton X-100 in PBS together with 


streptavidin-Alexa Fluor 633 (Invitrogen, 1:200) and DAPI (4’,6-diamidine-2- 
phenylindole dihydrochloride, Roche Diagnostics, 10 1gml~') in some experi- 
ments. Streptavidin binds to neurobiotin and thus labels neurobiotin-filled cells; 
DAPI binds to DNA and therefore labels nuclei. After a final wash in PBS, retinas 
were embedded in Prolong Gold antifade (Invitrogen). 

The following set of primary and secondary antibodies combinations were used 
in experiments in which we recorded from SPIGI cells while stimulating ChR2c- 
2A-DsRed2-expressing starburst cells: (1) Primary: goat anti-ChAT (1:200, 
AB144P, Chemicon). Secondary: donkey anti-goat IgG conjugated with Alexa 
Fluor 405 (1:200, Invitrogen). (2) Primary: rabbit anti-red fluorescent protein 
(REP; 1:200, AB3216, Chemicon). This primary antibody binds to DsRed2. 
Secondary: donkey anti-rabbit IgG conjugated with Cy3 (1:200, Jackson). (3) 
Primary: rat anti-GFP (1:500, 04404-84, Nacalai). Secondary: donkey anti-rat IgG 
conjugated with Alexa Fluor 488 (1:200, Invitrogen). The following set of primary 
and secondary antibodies combinations were used for staining rabies virus-infected 
retinas: (1) Primary: goat anti-ChAT (1:200, AB144P, Chemicon). Secondary: 
donkey anti-goat IgG conjugated with Alexa Fluor 633 (1:200, Invitrogen). (2) 
Primary: rabbit anti-red fluorescent protein (RFP; 1:200, AB3216, Chemicon). 
Secondary: donkey anti-rabbit IgG conjugated with Cy3 (1:200, Jackson). (3) 
Primary: rat anti-GFP (1:500, 04404-84, Nacalai). Secondary: donkey anti-rat IgG 
conjugated with Alexa Fluor 488 (1:200, Invitrogen). 

Confocal analysis. Stained retinas were analysed with a Zeiss LSM 700 confocal 
microscope. The DsRed2-expressing cell number was assessed from z-stack 
images by using a X20 lens, numerical aperture (NA) 0.7, X0.5 digital zoom. 
All images were recorded at the same laser power and gain control. Overall 
morphologies of the recorded starburst or ganglion cells were assessed using a 
X40 oil immersion lens, NA 1.2, X0.5 digital zoom or X63 oil immersion lens, NA 
1.3, X0.5 digital zoom. The mCherry-labelled presynaptic circuits of ON DS cells 
were assessed from z-stack images using a X63 oil immersion lens, NA 1.3. 
Statistical analysis. The non-parametric Mann-Whitney U test was used to 
compare data obtained from different cells on different days and the Wilcoxon 
signed rank test for comparing pairs of data where each pair was obtained from the 
same cell (excitation and inhibition). Significance is denoted by * for P< 0.05 and 
** for P< 0.01. The error bars and + values represent standard deviations (s.d.). 
On each figure, the lack of stars between any pairs of data signifies P > 0.05 and, 
therefore, that the two distributions are not statistically different. 
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Identification of two genes causing reinforcement in 
the Texas wildflower Phlox drummondii 


Robin Hopkins! & Mark D. Rausher? 


Species formation generates biological diversity and occurs when 
traits evolve that prevent gene flow between populations. 
Discerning the number and distribution of genes underlying these 
traits and, in a few cases, identifying the genes involved, has greatly 
enhanced our understanding over the past 15 years of species 
formation (reviewed by Noor and Feder’ and Wolf et al.’). 
However, this work has almost exclusively focused on traits that 
restrict gene flow between populations that have evolved as a by- 
product of genetic divergence between geographically isolated 
populations. By contrast, little is known about the characteristics 
of genes associated with reinforcement, the process by which natural 
selection directly favours restricted gene flow during the formation 
of species. Here we identify changes in two genes that appear to cause 
a flower colour change in Phlox drummondii, which previous work 
has shown contributes to reinforcement. Both changes involve cis- 
regulatory mutations to genes in the anthocyanin biosynthetic 
pathway (ABP). Because one change is recessive whereas the other 
is dominant, hybrid offspring produce an intermediate flower 
colour that is visited less by pollinators, and is presumably mal- 
adaptive. Thus genetic change selected to increase prezygotic isola- 
tion also appears to result in increased postzygotic isolation. 

Natural selection can directly favour species formation through a pro- 
cess termed reinforcement. If two incipient species experience secondary 
contact and produce maladaptive hybrids, selection favours decreased 
gene flow and increased reproductive isolation between them*”. 
Reinforcement can be recognized by a resulting pattern known as 
reproductive character displacement: reproductive isolation is greater 
in sympatry than in allopatry*. Although the occurrence of reinforcement 
was historically controversial®’, empirical studies have documented 
reinforcing selection and reproductive character displacement in birds, 
insects, amphibians, plants and mammals, suggesting that it is a com- 
mon step in the process of species formation (reviewed by Ortiz- 
Barrientos et al.’ and Pfennig and Pfennig’). Despite this work, little 
is known about genetic changes associated with reinforcement (but 
see studies of quantitative trait loci by Ortiz-Barrientos et al.’° and 
Saether et al."'). 

Divergence of floral colour in P. drummondii constitutes one of the 
best-documented cases of reinforcement in plants’? and exhibits the 
classic pattern of reproductive character displacement. P. drummondii 
and the closely related P. cuspidata produce similar light-blue flowers 
throughout the allopatric parts of their ranges. However, in the area of 
sympatry, P. drummondii has dark-red flowers, representing the only 
natural evolution of red flowers in the Phlox clade’*. Both species of 
Phlox and colours of P. drummondii are pollinated by the same array 
of species of Lepidoptera'*. Hybrids between these two species are 
formed at rates as high as 11% in the area of sympatry’*. The hybrids 
are vigorous but have high, although not complete, male and female 
sterility’? >. Experimental crosses indicate as many as 40% of hand- 
pollinated hybrid flowers will mature at least one seed and as many as 
72% of crosses sired by hybrid pollen will set one seed’’. Additionally, 
allozyme data show low levels of gene flow between these species of 
Phlox'*. Although other traits may contribute to prezygotic isolation 


(including possible reinforcement traits such as self-compatibility’’), 
Levin demonstrated that the shift from light-blue to dark-red flowers 
in P. drummondii decreases interspecific hybridization by 66%, indi- 
cating that the change in flower colour substantially increases prezygotic 
reproductive isolation’*. Given the above estimates, the hybridization 
rate before the evolution of dark-red flower colour could have been as 
high as 28%, which, with low hybrid fitness, would presumably create 
strong selection to decrease hybridization. 

We determined that the evolutionary transition from light-blue to 
dark-red flower colour in P. drummondii results from changes of large 
effect at two loci. F, populations derived from crosses between the 
allopatric colour variant (light blue) and the sympatric colour variant 
(dark red) segregate four discrete flower colours: dark blue, light blue, 
dark red and light red (Fig. 1a). Quantification of the spectral reflectance 
of 200 F, flowers, transformed into two-dimensional Commission 
internationale de léclairage (CIE) 1976 colour space’’, followed by 
discriminant analysis verified our discrete classifications (Supplemen- 
tary Fig. 1 and Supplementary Table 4a, b). The ratios of counts within 
these categories are very close to the 9:3:3:1 ratios expected from two 
loci with complete dominance ( 03, N=618) = 0.92, P = 0.8206) (Sup- 
plementary Table 3). One locus, H, determines flower hue, with blue 
allele dominant to red, whereas the second locus, I, determines colour 
intensity, with the dark allele dominant to the light. 

These two loci appear to determine the types and amounts of antho- 
cyanin floral pigment produced in P. drummondii. Anthocyanin pig- 
ments, the final products of the well-characterized and highly conserved 
ABP, are derived from six common types of anthocyanidin by the 
addition of sugar and/or methyl moieties'’. Less-hydroxylated antho- 
cyanidins give rise to redder pigments whereas more-hydroxylated 
anthocyanidins give rise to bluer pigments'’. Correspondingly, blue- 
flowered P. drummondii (H-) produce anthocyanins derived from both 
the less-hydroxylated cyanidin and peonidin pigments, as well as the 
more-hydroxylated malvidin pigment, whereas red flowers (hh) pro- 
duce exclusively the less-hydroxylated pigments (Fig. 1b). The change 
in floral hue thus results from redirecting flux from the malvidin branch 
of the anthocyanin pathway to the cyanidin/peonidin branch (Fig. Ic). 
Individuals with increased colour intensity (J-) produce more pigment 
than ii individuals, without an effect on pigment composition (Fig. 1b 
and Supplementary Table 5a, b). These biochemical patterns, coupled 
with the structure of the ABP (Fig. 1c), suggest candidate genes for the 
hue and intensity loci. 

The structure of the ABP (Fig. 1c) suggests that the loss of flux down 
the malvidin branch of the ABP might result from changes in one of three 
candidate genes: (1) loss of function or reduced expression of the gene 
coding for the branching enzyme flavanoid 3’5'-hydroxylase (F3’5’H); 
(2) alteration of the substrate specificity owing to a coding mutation in 
the gene for dihydroflavonol 4-reductase (DFR) making the enzyme 
unable to metabolize the malvidin precursor dihydromyricitin; and (3) 
a similar alteration of the substrate specificity of anthocyanidin 
synthase (ANS). To examine these possibilities, we analysed multiple 
F, populations for co-segregation of floral hue with single nucleotide 
polymorphism (SNP) markers in the candidate genes. In a total of 100 
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Figure 1 | Flower colour phenotypes in F, individuals. a, Representative 
pictures of the four flower colours in F, populations: dark blue, light blue, dark 
red, light red (from left to right). Total counts in F, populations indicated under 
each flower. Ratios of counts are similar to 9:3:3:1 (Caren N=618) = 0.92, 

P= 0.8206). b, Relative anthocyanidin pigment production results in variation 
in F, flower colour. Production of all three pigments is significantly different 
between hue classes. The light- and dark-red flowers produce no malvidin 
(Fa.25) = 134.03, P< 0.0001), and significantly more peonidin (F(,,25) = 58.88, 
P<0.0001) and cyanidin (Fi; 25) = 21.57, P = 0.0002). The amount of pigment 
production is significantly different between classes of flower intensity, with 
high-intensity flowers producing more of each flower colour pigment (Fmavidin 
(1,25) = 8.64, P = 0.0092; Freonidin (1,25) = 32.11, P< 0.0001; Foyanidin 

(1,25) = 66.74, P< 0.0001). Standard errors are shown. See Supplementary 
Table 5a-c for full data and multivariate analysis of variance (MANOVA) 
results. c, A simplified schematic of the ABP, showing core enzymes to the right 
of the arrows, with the substrates and products indicated at the ends of the 
arrows. The two branches of the pathway active in P. drummundii produce two 
3'-hydroxylated red pigments (cyanidin and peonidin) and one 5’- 
hydroxylated blue pigment (malvidin). 


individuals there was perfect co-segregation between F3’5'h and floral 
hue (Supplementary Table 7). Moreover, genotype at F3’5’h explains 
77% of the variation in flower hue. 

This genetic association corresponds to a downregulation of F3’5'h 
in red flowers. Among F, individuals, there is a nearly 100-fold 
decrease in F3'5'h transcripts in red-flowered compared with blue- 
flowered individuals (Fig. 2a). In addition, red- and blue-flowered 
individuals collected from multiple populations throughout the range 
of P. drummondii have a comparable difference in F3’5'h expression 
(Fig. 2b) These results demonstrate three features of the genetic change 
associated with the shift to red flowers: (1) variation in hue is associated 
with transcript level of F3'5’h; (2) genotype at F3’5'h predicts tran- 
script level; and (3) the expression difference evident between naturally 
occurring flower colour variants segregates as a single locus in F, 
individuals. 

These patterns imply that expression variation at F3’5'h is caused by 
variation in a cis-regulatory element. Allele-specific expression assays 
confirm this inference: in heterozygous (Hh) individuals, the red allele 
is almost completely downregulated (Fig. 2c). This allelic imbalance 
indicates a cis-regulatory change, whereas equal expression of the two 
alleles would indicate trans-acting regulation of expression”. 
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Figure 2 | Results of expression experiments on the hue locus (F3’5’h). 
Quantitative PCR results showing relative expression of F3'5'h in blue and red 
individuals in two F, populations (a) and six field populations (b). Transcripts 
of F3'5‘h in red floral tissue (shown in red) are significantly lower than levels 
detected in blue tissue (shown in blue) in both F, populations (F(,,26) = 250.89, 
P<0.0001) and field-collected individuals (F136) = 30.21, P = 0.0053). Delta- 
delta cycle-threshold (ddct) indicates relative transcript levels, and bars indicate 
two standard error units. c, Allele-specific expression of F3’5'h in multiple 
heterozygous individuals (Hh) from two crosses. Each bar represents the 
relative contribution of the red allele (shown in red) and the blue allele (shown 
in blue) to the total expression detected in heterozygous individuals from F, 
families C and D. Relative allelic representation in complementary DNA 
(cDNA) from heterozygous individuals is significantly different from the 
relative allelic representation in genomic DNA (gDNA) (Fi,,32) = 375.93, 
P<0.0001), indicating a cis-regulatory change controlling expression of the 
red allele. The genomic samples from heterozygous individuals show nearly 
equal allelic representation (0.5). See Supplementary Fig. 3 for experimental 
control data. 


Based on the structure of the ABP, we identified three candidate 
genetic changes that could explain the increased pigment production 
in dark (J-) flowers: (1) a cis-regulatory change that increases produc- 
tion of an ABP enzyme with control over pathway flux; (2) increased 
enzymatic efficiency (through coding-sequence mutations) of a rate- 
controlling enzyme in the ABP; and (3) increased expression or func- 
tion of an ABP transcription factor coordinately regulating the 
expression of several enzymes. To evaluate these possibilities, we 
cloned and sequenced genes coding for the core enzymes of the path- 
way (Fig. 1c), as well as an R2R3-Myb transcription factor orthologous 
to those known to regulate ABP enzymes in other angiosperms. Of 
these genes, the marker in R2R3-Myb co-segregates perfectly with 
flower colour intensity in 100 F, individuals (Supplementary Table 
8) and genotype explains 71% of the intensity variation. These obser- 
vations suggest Myb corresponds to the intensity locus. 

This correspondence is further supported by quantification of Myb 
transcript in F, individuals. Myb expression is significantly higher in 
dark individuals than light individuals (Fig. 3a), demonstrating that 
both genotype and expression of this transcription factor are associated 
with variation in pigment production. Furthermore, field-collected 
individuals show the same association between expression of Myb 
and flower colour intensity (Fig. 3b). Finally, analysis of allele-specific 
expression in heterozygous (Ji) individuals show that the ‘dark’ allele is 
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Figure 3 | Results of expression experiments of the intensity locus (R2R3- 
Myb). Relative expression of R2R3-Myb in dark and light F, individuals (a) and 
natural field populations (b). There is a significant upregulation associated with 
dark individuals relative to light both in F, individuals (Fq,12) = 8.11, 

P= 0.0173) and in field-collected individuals (F(,,59 = 14.91, P = 0.0023). 

c, Allele-specific expression indicates significantly different allelic 
representation in cDNA relative to gDNA in heterozygous individuals (Ji) from 
three F, families (F(,,12) = 116.74, P< 0.0001). The over-representation of the 
dark allele (dark grey) relative to the light allele (light grey) indicates a cis- 
regulatory change. See Supplementary Fig. 4 for complete control data. d, There 
is significant upregulation of transcript levels of all core ABP enzymes in field- 
collected dark individuals (dark grey) relative to light individuals (light grey). 
Fins (1,36) — 5.19, P= 0.0305; Feyi (1,36) — 5.07, P= 0.032; Fey (1,36) — 15.81, 
P= 0.0004; Fas, (1,36) = 13.45, P = 0.001; Fans 1.36) = 36.7, P< 0.0001. For full 
MANOVA results see Supplementary Table 10. Standard errors are indicated 
on graphs. 


upregulated relative to the ‘light’ allele (Fig. 3c), a pattern indicative ofa 
cis-regulatory change at the Myb locus. 

In all species that have been examined, this Myb coordinately activates 
several, if not all, of the ABP enzyme-coding genes’’. Thus we expect that 
if changes in expression of Myb influence pigment intensity, there should 
be correlated expression changes in the ABP enzyme-coding genes. This 
expectation was realized: all five core-enzyme genes exhibited significant 
upregulation in dark flowers relative to light flowers (Fig. 3d). These 
results indicate that a cis-regulatory mutation at the R2R3-MYB tran- 
scription factor causes the increased colour intensity of P. drummondii 
flowers in sympatric populations. 

Our investigation has provided the first identification of genetic 
changes causing reinforcement of species boundaries. Ideally, we 
would like to have confirmed the identity of the hue and intensity loci 
by either fine mapping or transformation, but this is not currently 
feasible in non-model systems like P. drummondii. Nevertheless, we 
are confident that we have identified the correct genes. Not only do 
F3'5'h and Myb co-segregate perfectly with the hue and intensity loci, 
respectively, but both exhibit changes in expression levels in the direc- 
tions that are expected to produce the observed phenotypic changes. 
Moreover, the alternative possibility that the changes are due to linked 
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transcription factors is ruled out by differences in allele-specific 
expression. 

We have shown that reinforcement may involve changes in a few 
genes, each change having a large phenotypic effect. Our results 
expand upon two previous analyses of the genetic architecture of re- 
inforcement, which report the involvement of a small number of 
quantitative trait loci’®”*. This simple genetic architecture is consistent 
with theoretical expectations, which indicate that selection for re- 
inforcement is most likely to result in increased reproductive isolation 
when the phenotypic effect of the assortative mating allele is large and 
selection for reproductive isolation is strong””°. 

Ortiz-Barrientos et al.'° suggest that reinforcing reproductive isola- 
tion should be inherited as a dominant trait because of Haldane’s sieve 
(the greater probability of a new dominant adaptive mutation reaching 
fixation than a recessive mutation’'). The intensity locus fits this 
expectation, with the derived dark allele dominant to the light allele. 
In contrast, the hue locus shows the reverse pattern, with the derived 
red allele recessive to the ancestral blue allele. Although it is easier for 
selection to increase the frequency ofa novel dominant allele, Haldane’s 
sieve can be overcome with strong selection’. Additionally, probability 
of fixation has been found to be independent of dominance when 
adaptations are not new mutations but are standing genetic variation”. 
Either of these possibilities could explain the fixation of the red allele in 
sympatry. 

Much work investigating the process of speciation focuses on categor- 
izing traits as a particular type of reproductive isolation mechanism 
(that is, prezygotic or postzygotic) (reviewed by Lowry et al.” and 
Nosil et al.**). Recently it has become clear that a single trait can affect 
multiple types of reproductive isolation’*”®, but it remains unclear how 
commonly this occurs. Although the most obvious effect of floral-colour 
evolution in P. drummondii is increased premating isolation with 
P. cuspidata, the associated genetic changes may also have led to 
increased postzygotic isolation. Hybrids between the dark-red-flowered 
P. drummondii and the light-blue-flowered P. cuspidata have dark-blue 
flowers, which differ from the two parental species. Levin has shown that 
pollinators discriminate against this intermediate hybrid flower colour”’. 
Although further experimental tests are required, this discrimination 
probably reduces male outcross success and possibly seed set, causing 
reduced fitness of the hybrids beyond that associated with intrinsic 
postzygotic isolation. Our work shows how the pattern of genetic domi- 
nance can influence whether a trait can contribute to multiple types of 
isolation. This hypothesized extrinsic postzygotic isolation arises only 
because the novel allele at the intensity locus is dominant, whereas the 
novel allele at the hue locus is recessive. Had the novel alleles at both 
loci exhibited the same dominance, hybrids would have had the same 
flower colour as one of the parents. 

Finally, both of the genetic changes contributing to reinforcement in 
P. drummondii involve cis-regulatory mutations, rather than func- 
tional (coding-sequence) mutations in pathway enzymes. The types 
of mutation described here are similar to those described for other 
cases of flower colour evolution in which reinforcement is not known 
to be involved’***”. These similarities suggest that the genetic basis of 
reinforcement may be comparable to that of adaptations not asso- 
ciated with reinforcement. However, only by indentifying the genetic 
basis of reinforcement in other systems can we understand what 
aspects of the molecular genetic basis of these traits are generally 
important for reinforcement and which are unique to a particular 
system. 


METHODS SUMMARY 


F, populations were created from crossing parents collected from allopatry with those 
collected from sympatry (Supplementary Table 1). F, individuals were phenotyped 
for flower colour and sampled for DNA and RNA extractions. Anthocyanidins were 
extracted from floral tissue using standard protocol and identified and quantified 
using high-performance liquid chromatography”. Candidate genes were amplified 
from parental individuals to identify SNPs, and F, individuals were genotyped. The 
Roche Universal ProbeLibrary system was used to quantify expression of all candidate 
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genes from F, and field-collected populations. Pyrosequencing technology was used 
to quantify allele-specific expression in individuals heterozygous at flower colour 
loci’®. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


F, crosses. Seeds were collected from plants in six populations (Supplementary 
Table 1) in the spring of 2006, and germinated and grown at the Duke University 
greenhouses. Seeds were soaked for 48h in 500 p.p.m. giberellic acid, planted in 
Metro-Mix 360 (Sun Gro Horticulture), and stratified for 6 days at 4°C. Plants 
were transplanted at the four-true-leaf stage into a 50:50 mix of Fafard 4P (Conrad 
Fafard) and PermaTill One Time (Carolina Stalite Company). Light-blue plants 
were crossed with dark-red plants to form F, seeds. F, seeds were grown as above 
and self-fertilized to create F, populations. Self-incompatibility was overcome 
using bud pollination. 

Flower colour phenotyping. All F, individuals were categorically phenotyped for 
flower colour, and a 7’ test was used to determine if the ratio of categorical flower 
colour counts differed from 9:3:3:1 (Supplementary Table 3). A subset of indivi- 
duals were quantitatively phenotyped with a StellarNet EPP2000C spectrometer 
with a SL1 visible light source and a SL5 deuterium +halogen ultraviolet—visible 
light source (StellarNet). Raw reflectance was transformed into two-dimensional 
CIE 1976 LUV colour space’® (Supplementary Fig. 1). This transformation creates 
two axes of variation: the x axis corresponds to w’ and the y axis corresponds to v’. 
The white point is located at (v’ = 0.4683305, u’ = 0.197833). The v’ coordinate 
represents a measure of hue, and distance from the white point is a quantitative 
measure of intensity. We used canonical discriminant analysis to confirm that 
flower colour grouped into four discrete categories (Supplementary Table 4a, b). 
Anthocyanindins were extracted from a total of 26 individuals (12 from cross A, 14 
from cross B) using standard methods of dissolving petal tissue in HCl followed by 
isoamyl alcohol extractions*®. Pigments were identified using high-performance 
liquid chromatography as described in Des Marais and Rausher*’. Pigments were 
quantified by calculating the area under the high-performance liquid chromato- 
graphy peak and scaled to the corresponding standards. We used a MANOVA to 
determine if pigment amount differed by intensity, hue, family and all interactions 
of main effects. We used subsequent individual ANOVA models to determine 
significance of each of the above effects on amount of individual pigments (Sup- 
plementary Table 5a, b). Allanalyses used SAS software version 9.1 (SAS Institute). 
Genetic association. Leaf tissue was collected from F, and parental individuals. A 
modified cetyltrimethylammonium bromide (CTAB) extraction was used to isolate 
the DNA (as in Kelly and Willis*' but with an additional 2% Triton X-100 added to 
the CTAB solution and a3 M sodium acetate wash after ethanol precipitation). Genes 
in the ABP were amplified, first using degenerate primers designed from orthologous 
genes in closely related species and then with species-specific primers in subsequent 
amplifications (Supplementary Table 6). Parental sequences for each gene were used 
to identify SNPs segregating in the F, populations. A subset of F, individuals were 
genotyped at each ABP gene. For those genes that showed an association, subsequent 
F, individuals were genotyped. 7° tests were used to confirm associations between 
genotype and phenotype (Supplementary Tables 7 and 8). A mixed-model ANOVA 
with F, family as a random effect was used to determine how much quantitative 
flower colour variation was explained by F3’5'h and the R2R3-Myb genotype. For this 
analysis we used flower colour reflectance transformed into CIE colour space to 
determine quantitative measures of hue and intensity (see above). The y axis, cor- 
responding to the value of v’, is the measure of hue and the distance each flower 
colour point is from the white point (v’ = 0.4683305, u’ = 0.197833) is the measure 
of intensity. All analyses used SAS software version 9.1 (SAS Institute). 
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RNA expression analyses. All expression analyses were performed on flower-bud 
tissue collected approximately 2 days before opening. RNA was extracted from 
individuals using the SpectrumTM Plant Total RNA Kit (Sigma-Aldrich). The 
Roche Applied Science Universal ProbeLibrary (Roche Diagnostics) was used to 
quantify expression of each gene in the ABP**. Probes sites and primers were 
designed using the online design centre (http://www.roche-applied-science.com/ 
sis/rtpcr/upl/ezhome.html) (Supplementary Table 9). A Thermo Scientific Verso 
1-Step RT-qPCR kit was used to amplify according to the manufacturer’s instruc- 
tions (Fisher Scientific). Twenty-six F, individuals were used in the F3’5'h 
expression assay. Twelve F2 individuals were used in the Myb expression assay. 
Two replicate reactions were performed for each sample and the average cycle- 
threshold value was used in all analyses. Efl-« was amplified in each sample to 
control for total amount of RNA in each extraction. Raw expression data were 
analysed as in Rieu and Powers*’. We used a mixed-model ANOVA to detect a 
significant difference in expression in candidate genes between colour groups. F, 
family was used as a random effect in the model. 

We collected seeds from natural populations of P. drummondii in both allopatry 

and sympatry (Supplementary Table 2), grew them in the Duke University green- 
house and extracted RNA from bud tissue as described above. Thirty-six indivi- 
duals were used from field-collected population expression assays, with an 
additional 14 individuals from two additional populations for the Myb assay. 
Transcript levels of all genes in the ABP were quantified as above. We used a 
MANOVA to determine if flower colour intensity has an effect on expression of 
non-causal ABP genes (Chs, Chi, F3h, Dfr, Ans). Subsequent ANOVAs were used 
to determine if each individual gene shows a significant effect of intensity on 
expression (Supplementary Table 10). All analyses were performed using SAS 
software version 9.1 (SAS Institute). 
Allele-specific expression. RNAs from F, individuals and heterozygous F, indi- 
viduals were used to quantify allele-specific expression at both the F3’5’h and the 
Myb genes. A schematic of the assay design (Supplementary Fig. 2) shows the SNP 
identity, as well as amplification and sequencing primer sequences. RNA was 
extracted as described above from 28 individuals for the Myb assays (11 hetero- 
zygotes and seven homozygous controls) and from eight individuals for the F3'5’h 
assay (six heterozygotes and two homozygous controls). A reverse transcription 
reaction was performed using InvitrogenTM SuperScript II (Life Technologies) 
according to the manufacturer’s instructions. DNA was extracted from leaf tissue 
as described above. For each individual, two replicate DNA and four replicate 
cDNA PCRs were run. No-template controls and no-sequencing-primer controls 
were performed as well. Pyrosequencing reactions were run on all samples using 
PyroMARKTM Q96ID (Qiagen)'***. Experiments on both genes were indepen- 
dently replicated. 
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Homeostasis of self-renewing small intestinal crypts results from 
neutral competition between Lgr5 stem cells, which are small cyc- 
ling cells located at crypt bottoms’. Lgr5 stem cells are inter- 
spersed between terminally differentiated Paneth cells that are 
known to produce bactericidal products such as lysozyme and 
cryptdins/defensins’. Single Lgr5-expressing stem cells can be cul- 
tured to form long-lived, self-organizing crypt-villus organoids in 
the absence of non-epithelial niche cells*. Here we find a close 
physical association of Lgr5 stem cells with Paneth cells in mice, 
both in vivo and in vitro. CD24* Paneth cells express EGF, TGF-a, 
Wnt3 and the Notch ligand DII4, all essential signals for stem-cell 
maintenance in culture. Co-culturing of sorted stem cells with 
Paneth cells markedly improves organoid formation. This 
Paneth cell requirement can be substituted by a pulse of exogenous 
Wnt. Genetic removal of Paneth cells in vivo results in the concom- 
itant loss of Lgr5 stem cells. In colon crypts, CD24* cells residing 
between Lgr5 stem cells may represent the Paneth cell equivalents. 
We conclude that Lgr5 stem cells compete for essential niche signals 
provided by a specialized daughter cell, the Paneth cell. 

In a Matrigel-based culture system containing EGF, the Wnt agonist 
R-spondin 1 and the BMP inhibitor noggin’, single Lgr5 stem cells 
autonomously grow into crypt-like structures with de novo generated 
stem cells and Paneth cells at their bottom. The remainder of these crypts 
consists of transit-amplifying cells, which feed into villus-like luminal 
domains containing post-mitotic enterocytes and goblet cells. Thus, a 
single Lgr5 intestinal stem cell can generate a continuously expanding, 
self-organizing organoid reminiscent of normal gut in the absence of a 
subepithelial cellular niche. Confocal cross-sectioning of crypt bottoms 
of Lgr5-EGFP-ires-creERT2 mice revealed an almost geometrical distri- 
bution of Paneth cells and Lgr5 stem cells that maximized heterotypic 
contact area (Paneth-stem cell) and minimized homotypic contact area 
(Fig. la-c). The same intimate contact was observed in the organoid 
cultures at crypt bottoms (Fig. 1d and Supplementary Movie 1). 

The hypothesis that Paneth cells supply essential niche signals was 
rejected previously’. To retest this, stem cells that were sorted from 
Lgr5-EGEP-ires-creERT2 mice based on GFP expression® were recom- 
bined with wild-type Paneth cells sorted for CD24 expression (Fig. 2a, 
c). Of note, CD24-expressing cells reside between Lgr5 stem cells in 
colon crypts (Fig. 2b), indicating that these are related to Paneth cells. 
Indeed, a secretory cell type, distinct from goblet cells, resides at colon 
crypt bottoms’. Stem cells and/or Paneth cells were seeded in round- 
bottomed plates in 10% Matrigel. Reassociated Lgr5 stem cells typically 
formed short-lived, cystic clusters (Fig. 2d). Reassociated Paneth cells 
tended to form larger aggregates (Fig. 2e) that disintegrated after 
5 days. In three independent experiments, long-lived GFP organoids 
were formed in only 6.7 + 3.3% of 10 wells per experiment containing 
500 Lgr5 stem cells each, and in 0% of 10 wells per experiment contain- 
ing 500 Paneth cells each (we occasionally observed GFP-negative 
organoids originating from contaminating wild-type Paneth cells). 
When 500 stem cells and 500 Paneth cells were combined, GFP* 


organoids formed in 76.7 + 8.8% of 10 wells per experiment (Fig. 2f, 
g). The dynamic reassociation process was illustrated using Lgr5* cells 
sorted from a clonal RFP* organoid culture and Paneth cells from a 
clonal YFP* organoid culture. As shown in Supplementary Movie 2 
(see also Fig. 2h), multiple cell clusters formed initially. The organoids 
fused to form one or two large organoids per well (Supplementary 
Movie 2). Thus, Lgr5 stem cells and Paneth cells appeared to require 
physical contact. Indeed, Lgr5 stem cells are critically dependent on 
Notch signals*"°, which depend on direct cell-cell contact. 

We then performed comparative gene expression profiling on stem 
and Paneth cells. The heat-map in Fig. 3a confirmed the segregation of 
Paneth cell markers (lysozyme, defensin Al (ref. 3)) and stem-cell 
markers (Lgr5, Olfm4, Tnfrsf19, Cdca7 (ref. 11)). Among the genes 
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Figure 1 | Geometric distribution pattern of Paneth cells and Lgr5 stem 
cells. a, b, Confocal cross-section of Lgr5-EGFP-ires-creERT2 intestine. 
E-cadherin (a; white) demarcates cell borders. Lgr5 stem cells (green) and 
Paneth cells (a, black; b, lysozyme, red) are shown. c, Contact area of either 
Paneth cells or Lgr5 stem cells was quantified with Image J. The values are 
depicted as mean + s.d. from three independent mice. Red columns and green 
columns indicate contact area with Paneth cells and Lgr5 stem cells, 
respectively. d, Stills from Supplementary Movie 1. Time course of crypt 
organoid growth. Differential interference contrast image reveals granule- 
containing Paneth cells (red arrowheads) at the site of budding where a new 
crypt forms. Lgr5-GFP (green) stem cells expand at the crypt base in close 
proximity to Paneth cells. Asterisk and dotted oval indicate autofluorescence. 
Scale bar: 50 um. 
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most highly enriched in Paneth cells, we noted Wnt3, Wnt11, Egf, Tgfa 
and the Notch ligand Dil4 (Fig. 3a). Wnt3 expression’? and EGF 
expression’* had been noted previously. Thus, Paneth cells provided 


Figure 3 | Paneth cells produce Wnt3 and other essential niche signals for 
Lgr5 stem cells. a, Heat-map of two independent microarray expression 
experiments (1 and 2; 3 and 4) performed with dye-swap (1 versus 2 and 3 
versus 4) from sorted Paneth cells versus Lgr5 stem cells. b, Wnt3 is expressed 
by Paneth cells at crypt bottoms as analysed by in situ hybridization. 

c-h, Localized Wnt production regulates crypt—villus morphogenesis in 
culture. c, Freshly isolated crypts from an Axin2-lacZ mouse were cultured in 
standard EGF/noggin/R-spondin 1 medium (ENR) for 4 days. LacZ response is 
only seen near the bottoms of the two crypts. d, Intestinal adenoma samples 
from APC”™” mice were cultured in ENR medium in the absence of R-spondin 
for 7 days. e, Axin2-LacZ crypts grown in ENR medium. f, As in e, with the 
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Figure 2 | Paneth cells express CD24 and support growth of Lgr5 stem cells. 
a, Isolated small intestinal crypt. CD24 (red) is expressed by Paneth cells in 
which granules are visualized by differential interference contrast. Lgr5-GFP* 
stem cells are green. Counter stain: DAPI (blue). b, Isolated colonic crypt. 
CD24" cells (red) are in intimate contact with Lgr5 stem cells (green). Top, 
longitudinal crypt section; bottom, section through crypt bottom. c, FACS plot 
of dissociated single cells from small intestinal crypts. Two CD24 bright 
populations differ by side-scatter (SSC) pattern. Sorted CD24"SSC'” and 
CD24"SSC cells are subsequently stained. CD24"'SSC'” cells are positive for 
the enteroendocrine marker chromogranin A (top right), whereas CD24"™/ 
SSC" cells are positive for the Paneth marker lysozyme (bottom right). 

d-g, Single sorted Lgr5 stem cells from Lgr5-EGFP-ires-creERT2 small intestine 
(d), sorted single Paneth cells (e) from wild-type small intestine, and a 
combination of the two cell types (f) were seeded in round-bottomed wells and 
cultured for 2 days. g, Lgr5 stem cells form expanding Lgr5-GEP* (green) 
organoids only when reassociated with Paneth cells. h, Stills from 
Supplementary Movie 2. Time course of the reassociation culture with RFP* 
Lgr5-GFP stem cells (red) and YEP* Paneth cells (yellow). Scale bar, 50 jum. 


essential signals for stem-cell support: EGF, Wnt3 and Notch. High- 
level expression of Wnt3 was confirmed by in situ hybridization 
(Fig. 3b). 

R-spondin 1 potently amplifies Wnt responses, yet is inactive on its 
own". When organoids were grown from crypts derived from Axin2- 
LacZ mice’’, Wnt responses as assayed by LacZ expression were 
restricted to the crypt base, despite the ubiquitous presence of 
R-spondin 1 (Fig. 3c, e and Supplementary Fig. 1). When exogenous 
Wnt3A was added, the organoids diffusely expressed the blue Wnt 
reporter (Fig. 4g). The global response to Wnt caused the typical crypt- 
villus architecture to change into rounded cysts devoid of differen- 
tiated cell types (Fig. 3e, g). Indeed, Wnt signalling instructs intestinal 
cells to adopta proliferative progenitor phenotype’®. The same rounded 
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addition of porcupine inhibitor IWP1 at 1 uM. g, Crypts from an Axin2-lacZ 
mouse cultured in ENR medium plus Wnt3A. h, Same as g, with the addition of 
IWP1. Insets in e-h depict Axin2-LacZ expression (blue). e, g, h, Six days 
culture; f, 3 days culture after which the organoids disintegrate. See also 
Supplementary Fig. 2. i, Plating efficiency of Lgr5 stem cell-Paneth doublets, 
Lgr5 stem cell doublets, single Paneth cells and single Lgr5 stem cells with (grey) 
or without (black) Wnt3A at 100 ng ml’. Assays were read out as budding 
organoids at 14 days after sorting. The values are depicted as mean + standard 
error of the mean (s.e.m.) from three independent experiments. ND, not 
detected. See also Supplementary Fig. 3 and Methods for details of doublet 
isolation and culture. All scale bars, 50 um. 
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cysts were routinely observed upon culturing APC-deficient cells from 
APC™" adenomas!” (Fig. 3d). When the small-molecule Wnt secretion 
inhibitor (porcupine inhibitor) IWP1 (ref. 18) was added, the Axin2- 
LacZ signal in wild-type organoids was entirely lost and proliferation 
halted (compare Fig. 3e and f; a dose-response curve is given in Sup- 
plementary Fig. 2). This inhibition could be overcome by exogenous 
Wnt3A (Fig. 3g, h and Supplementary Fig. 2), confirming the specificity 
of the Wnt secretion inhibitor. We concluded that exogenous 
R-spondin 1 acts by amplifying the local response to short-range 
Wnt produced by Paneth cells. Thus, only the direct neighbours of 
Paneth cells, the Lgr5 stem cells, receive strong Wnt signals, which 
can be further increased by R-spondin 1. Moreover, these observations 
indicated that the asymmetry of crypt—villus organoids was established 
by the localized presence of Wnt-producing Paneth cells. We recently 
observed that stem cell-Paneth cell doublets display a strongly 
increased plating efficiency compared to single stem cells”. This 
Paneth-cell-dependence of single stem cells, illustrated in Supplemen- 
tary Fig. 3, could be overcome by the addition of Wnt3A at 100 ng ml * 
for the first 3 days of culture (Fig. 3i). 

To investigate, using in vivo models, whether Paneth cells provide 
essential support to Lgr5 stem cells, we used three previously described 
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genetic mouse models for Paneth cell loss: mutation of Gfil (ref. 19), 
transgenic expression of diphtheria toxin A under the Paneth-cell-spe- 
cific cryptdin 2 promoter (CR2-tox176 (ref. 5)) and conditional deletion 
of Sox9 (refs 20, 21). When we re-visited crypts of Gfil~’~ adult mice, 
described to lack Paneth cells'’, Paneth cell numbers were reduced but 
not absent, as also seen recently” (Fig. 4a, b and Supplementary Fig. 4). 
Stem cells were coincidently decreased in number (Fig. 4d, e) and co- 
localized with remaining Paneth cells, as visualized by double staining 
for Olfm4 and lysozyme (Fig. 4g, h). Similarly, in the CR2-tox176 mice, 
we noted that Paneth cells were reduced but present (Fig. 4c and 
Supplementary Fig. 4), in agreement with the reported 82% decrease 
in Paneth cell numbers’. Numbers of stem cells were again decreased, 
coincident with Paneth cells (Fig. 4c, f, i-k). Lysozyme staining in the 
sequential intestinal sections from both models revealed that >90% of 
crypts harboured at least one Paneth cell (not shown). We speculated 
that the minority of crypts without observable Paneth cells were short- 
lived, as observed in the conditional Sox9 model (see below). 

We conditionally deleted the Sox9 gene in 6-week-old mice, homo- 
zygous for a Sox9" allele and heterozygous for the Ah-cre allele”. Paneth 
cells are estimated to have a lifetime of 8 weeks”. The Sox9 gene was 
efficiently deleted in all crypt cells with the exception of pre-existing 
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Paneth cells, where the Ah-cre transgene is not activated’’. Although 
Sox9 is expressed in Lgr5 stem cells'', we observed no stem-cell pheno- 
type at early time points after deletion (Supplementary Fig. 5a, b). From 
4weeks onwards, Paneth cell numbers visibly decreased. Loss was 
virtually complete after 7-8 weeks (Fig. 4s), after which a regenerative 
response occurred. We occasionally noted Sox9 /~ crypts witha single 
remaining Sox9-positive Paneth cell (Supplementary Fig. 5c; red 
arrows). Stem cells disappeared coincident with Paneth cells, and the 
remaining stem cells crowded around the remaining Paneth cells 
(Supplementary Fig. 5d). Supplementary Fig. 6 depicts a field of 
‘escaper’ wild-type crypts adjacent to a field of Sox9-negative crypts. 
The escaping wild-type crypts containing abundant Paneth cells 
rapidly replaced the mutant crypts by crypt fission (Supplementary 
Fig. 6). By day 67, all crypt basal cells were Sox9* again (Fig. 4n) and 
contained normal numbers of Paneth cells (Fig. 4t) and stem cells 
(Fig. 4q). From this, we concluded that Paneth cells are essential for 
the maintenance of crypts and stem cells. 

Stem cell niches are typically portrayed as pre-existing sites to which 
stem cells migrate”. Here we show that intestinal stem cells receive 
niche support from their own specialized progeny. This is not without 
precedent, as the somatic stem cells of the fly testis give rise to differ- 
entiated cells that in turn build the testis niche*®. Thus, Paneth cells 
serve as multifunctional guardians of stem cells, both by secreting bac- 
tericidal products and by providing essential niche signals. Lgr5 stem 
cells divide symmetrically and their numbers are restricted by neutral 
competition at the stem-cell population level”. We now propose that 
Lgr5 stem cells compete for available Paneth cell surface. Paneth cell 
numbers must therefore be tightly regulated, which is indeed the case. 
Paneth cells are generated directly above the crypt base, the latter ori- 
ginally termed the ‘stem cell zone?’”*. It will be of interest to understand 
what determines Paneth cell numbers and their slow turnover rate. 


METHODS SUMMARY 


Reagents. Murine recombinant EGF and noggin were from Peprotech; Wnt3A 
was from Millipore. Human recombinant R-spondin 1 was provided by A. Abo”. 
Y-27632 was from Sigma; IWP1 was provided by L. Lum’. 

Mice. Lgr5-EGFP-ires-creERT2 mice', APC’ ™" (ref. 17), Axin2-lacZ mice", Gfil zis 
(ref, 19), CR2-tox17 (ref. 5), Sox" (ref. 20) and R26R-confetti” mice have been 
described earlier. The transgenic Ah-cre line”? was crossed with Sox9" mice. Cre 
enzyme was induced by intraperitoneal injections of 200 il B-napthoflavone (10 mg 
ml}; Sigma) dissolved in corn oil for three consecutive days. 

Crypt isolation, cell dissociation and culture. Crypt isolation, cell dissociation 
and culture have been described previously”; see Methods for details. 
Reassociation assay. A total of 500 sorted Lgr5-GFP™ stem cells (purity >99%) 
were co-cultured with 500 genetically unmarked CD24* Paneth cells (purity 
>95%). Cells were re-suspended in 100 ul of culture medium in Ultra-low attach- 
ment 96-well round-bottomed plates (Corning) and the plates were left on ice for 
15 min. The plate was centrifuged (300g) for 5 min and 10 ul of Matrigel was added 
to each well. For Supplementary Movie 2, Lgr5-GFP™/confetti-RFP* and Lgr5- 
GFP/confetti-YFP* stem cells were sorted separately from Lgr5-EGFP- 
ires-creERT2 X R26R-confetti mice’, tamoxifen-induced 3 days before they were 
killed. After 10 days of culture, 1,500 Lgr5-GFP"'/CD24°™/confetti-RFP* stem 
cells and 1,500 CD24"'/confetti-YFP* Paneth cells were sorted from these two 
respective organoid cultures, and filmed for ten consecutive days, interrupted twice 
for exchange of medium (see Methods for details). 

Tissue preparation for confocal analysis. For semi-thick sectioning of near- 
native tissue, organs were fixed in 4% paraformaldehyde at room temperature 
for 20 min and washed in cold PBS. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Reagents. Murine recombinant EGF and noggin were from Peprotech. Murine 
recombinant Wnt3A was from Millipore. Human recombinant R-spondin 1 was 
provided by A. Abo”. Y-27632 was from Sigma. IWP1 was provided by L. Lum’’. 
Mice. Lgr5-EGFP-ires-creERT2 mice', APC” (ref. 17), Axin2-lacZ mice"’, Gfil a 
(ref. 19), CR2-tox17 (ref. 5), Sox9™" (ref, 20) and R26R-confetti” mice have been 
described earlier. The transgenic Ah-cre line”® was crossed with Sox9™”" mice. Cre 
enzyme was induced by intraperitoneal injections of 200 1l B-napthoflavone (10 mg 
ml |; Sigma Aldrich) dissolved in corn oil for three consecutive days. 

Crypt isolation, cell dissociation and culture. Crypt isolation, cell dissociation 
and culture have been described previously™*. For culture/sorting experiments, at 
least three independent experiments were performed. For each experiment, 
crypts/cells were pooled from three intestines. For microarray, sorted cells from 
ten intestines were pooled. Crypts were directly cultured as previously described 
(100 crypts per well on 24-well plates)*. For single-cell or doublet-cell culture, 
crypts were dissociated with TrypLE express (Invitrogen) including 2,000 U ml * 
DNase (Sigma) for 30 min at 37 °C or 2h at room temperature. For reassociation 
assay from established crypt organoids, the samples were dissociated with TrypLE 
express for 15 min at 37°C. Dissociated cells were passed through 20-11m cell 
strainer (Celltrix) and washed with PBS. Cells were stained with PE-conjugated 
anti-CD24 antibody (eBioscience) and APC-conjugated anti-Epcam antibody 
(eBioscience) for 15min at 4°C, and analysed by MoFlo (DakoCytomation). 
Viable epithelial single cells or doublets were gated by forward scatter, side scatter 
and pulse-width parameter, and negative staining for propidium iodide or 7-ADD 
(eBioscience). Sorted cells were collected, pelleted and embedded in Matrigel (BD 
bioscience), followed by seeding on a 96-well plate (30-50 singlets or doublets; 
10 pl Matrigel per well). Culture medium (Advanced DMEM/F12 supplemented 
with penicillin/streptomycin, 10 mM HEPES, Glutamax, 1X N2, 1X B27 (all from 
Invitrogen) and 1 iM N-acetylcysteine (Sigma) containing growth factors 50 ng 
ml * EGF, 100ngml~' noggin, 1 ugml-* R-spondin) was overlaid. Y-27632 
(10 1M) was included for the first 2 days to avoid anoikis. Growth factors were 
added every other day and the entire medium was changed every 4 days. In some 
experiments, 100 ngml~’ Wnt3A (Millipore) was added in the culture medium. 
Sorted cells were manually inspected by inverted microscopy, and the numbers of 
viable organoids in triplicate were calculated. 

Reassociation assay. A total of 500 sorted Lgr5-GEP™ stem cells (purity >99%) 
were co-cultured with 500 genetically unmarked CD24* Paneth cells (purity 
>95%). Cells were re-suspended in 100 pl of culture medium in Ultra-low 
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attachment 96-well round-bottomed plates (Corning) and the plate was left on 
ice for 15min. The plate was then centrifuged (300g) for 5min and 10 ul of 
Matrigel was added in each well. For Supplementary Movie 2, Lgr5-GFP"™/ 
confetti-RFP* and Lgr5-GFP"/confetti-YFP* stem cells were sorted separately 
from Lgr5-EGFP-ires-creERT2 X R26R-confetti mice”, tamoxifen-induced three 
days before the mice were killed. After 10 days of culture, 1,500 Lgr5-GFP™/ 
CD24*"/confetti-RFP* stem cells and 1,500 CD24'/confetti-YEP* Paneth cells 
were sorted from these two respective organoid cultures, and filmed for ten con- 
secutive days, interrupted twice for exchange of medium. The fluorescent and 
phase-contrast images were acquired every 20min by inverted microscopy 
(AF7000, Leica) equipped with a live imaging chamber (humidified with sterile 
water and maintained at 37 °C, 7.5% CO3). 

Microarray analysis. Single Lgr5-GFP™ cells or Paneth cells were sorted into 
Buffer RLT in the RNeasy Micro kit (Qiagen). Microarray analysis (Agilent) 
was performed as previously described*. The data were deposited to the GEO 
database (accession number GSE25109). Heat-maps were created using 
Treeview software. 

Histology, immunohistochemistry and in situ hybridization. Samples taken 
from the middle of the small intestine were fixed in 4% paraformaldehyde (PFA), 
embedded in paraffin, and processed as previously described’. The primary anti- 
bodies were: mouse anti-E-cadherin (1:100, BD transduction), mouse anti-Ki67 
(1:250, Monosan), mouse anti-phospho histone H3 (1:1,000, Millipore), rabbit 
anti-Sox9 (1:600, Millipore), rabbit anti-lysozyme (1:1,000, Dako) and anti- 
chromogranin A (1:100, Santa Cruz). The secondary antibodies were peroxidase- 
conjugated antibodies or Alexa-568-conjugated antibodies. For whole-mount 
immunostaining, freshly isolated crypts were fixed with 4% PFA, and stained with 
anti-CD24 antibody (eBioscience) over night at 4°C. After washing, the samples 
were incubated with Alexa-568-conjugated anti-rat antibody over night at 4 °C. 
DNA was stained by DAPI (Molecular Probe). For counting the number of Lgr5 
stem cells (GFP) and Paneth cells (lysozyme), 1 cm’ of PFA-fixed intestinal wall was 
put in a mould. Four per cent low-melting-point agarose (40 °C) was added and 
allowed to cool on ice. Once solid, a vibrating microtome (HM650, Microm) was 
used to make semi-thick sections (150 um) (velocity, 1 mm st; frequency, 65 Hz; 
amplitude, 0.9 mm)’. Sections were permeabilized and stained as previously 
described’. Images were acquired with confocal microscopy (Leica, SP5). For in situ 
hybridization, cRNA probes were generated from full-length cDNA expression 
vectors (IMAGE consortium or RPZD) with in vitro transcription. The protocol 
was described elsewhere’’. X-gal staining was performed as previously described’. 


©2011 Macmillan Publishers Limited. All rights reserved 


LT aR 


doi:10.1038/nature09674 


Reduction of disulphide bonds unmasks potent 
antimicrobial activity of human $-defensin 1 


Bjoern O. Schroeder!?, Zhihong Wu’, Sabine Nuding’?, Sandra Groscurth*}, Moritz Marcinowski’, Julia Beisner’?, 
Johannes Buchner”, Martin Schaller®, Eduard F. Stange’ & Jan Wehkamp!*” 


Human epithelia are permanently challenged by bacteria and 
fungi, including commensal and pathogenic microbiota’. In the 
gut, the fraction of strict anaerobes increases from proximal to 
distal, reaching 99% of bacterial species in the colon’. At colonic 
mucosa, oxygen partial pressure is below 25% of airborne oxygen 
content, moreover microbial metabolism causes reduction to a low 
redox potential of —200 mV to -300 mV in the colon*. Defensins, 
characterized by three intramolecular disulphide-bridges, are key 
effector molecules of innate immunity that protect the host from 
infectious microbes and shape the composition of microbiota at 
mucosal surfaces’ *. Human f-defensin 1 (hBD-1) is one of the 
most prominent peptides of its class but despite ubiquitous 
expression by all human epithelia, comparison with other defensins 
suggested only minor antibiotic killing activity”’®. Whereas much 
is known about the activity of antimicrobial peptides in aerobic 
environments, data about reducing environments are limited. 
Herein we show that after reduction of disulphide-bridges hBD-1 
becomes a potent antimicrobial peptide against the opportunistic 
pathogenic fungus Candida albicans and against anaerobic, Gram- 
positive commensals of Bifidobacterium and Lactobacillus species. 
Reduced hBD-1 differs structurally from oxidized hBD-1 and 
free cysteines in the carboxy terminus seem important for the 
bactericidal effect. In vitro, the thioredoxin (TRX) system” is able 
to reduce hBD-1 and TRX co-localizes with reduced hBD-1 in 
human epithelia. Hence our study indicates that reduced hBD-1 
shields the healthy epithelium against colonisation by commensal 
bacteria and opportunistic fungi. Accordingly, an intimate inter- 
play between redox-regulation and innate immune defence seems 
crucial for an effective barrier protecting human epithelia. 

We modified the radial diffusion assay’? to analyse antimicrobial 
activity of synthetic hBD-1 against anaerobic bacteria of the normal 
flora under anaerobic conditions. Increasing concentrations of the 
reducing agent dithiothreitol (DTT) were added to assay medium. In 
medium without DTT hBD-1 did not affect growth of Gram-positive 
Bifidobacterium adolescentis (Fig. 1a). Surprisingly, addition of increas- 
ing amounts of DTT led to an increase of inhibition zones in size and 
sharpness. In contrast, human -defensin 3 (hBD-3) and lysozyme 
showed less antimicrobial activity upon addition of DTT (Fig. 1b and 
Supplementary Fig. 1). Thus, increased antimicrobial activity of hBD-1 
in reducing environment was specific for hBD-1 and not caused by 
impaired bacterial fitness. Remarkably, at concentrations of 2mM DTT 
hBD-1 became as effective as hBD-3 against B. adolescentis (Fig. 1c). 
This finding is crucial as hBD-3 is one of the most powerful antimi- 
crobial peptides in oxygen-rich environment, whereas hBD-1 was 
thought to be one of the weakest. 

For the Gram-negative anaerobe Bacteroides vulgatus we found no 
antimicrobial effect of hBD-1 and lysozyme under any conditions, 
whereas hBD-3 showed concentration-dependent inhibition zones 
(Supplementary Fig. 2). 


Because -defensins contain three intramolecular disulphide-bridges”” 
we investigated the involvement of cystines for the observed antimicro- 
bial effect. Therefore, we incubated hBD-1 with increasing con- 
centrations of DTT and analysed samples using matrix-assisted laser 
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Figure 1 | hBD-1 shows antimicrobial activity under reducing conditions. 
a, b, Bifidobacterium adolescentis was incubated with up to 2mM DTT under 
anaerobic conditions with hBD-1 (a) or hBD-3 (b). Inhibition zones were 
measured and statistically evaluated using student’s t-test with *P < 0.05, 
**P < 0.01, ***P < 0.001. Representative radial diffusion assays of three 
experiments are shown. Data are presented as means, error bars indicate 
standard deviation. c, Comparison of inhibition zones between hBD-1 and 
hBD-3 in growth medium without or with 2mM DTT. Dotted lines at 2.5 mm 
represent base line diameter of punched wells. 
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desorption/ionisation (MALDI)-mass spectroscopy (MS). A single signal 
could be detected for oxidized hBD-1 (oxhBD-1) at m/z 3,926.8 (Fig. 2a). 
DTT-treatment and carboxamidomethylation resulted in a second signal, 
corresponding to the completely reduced form (redhBD-1) with all six 
cysteine residues alkylated. These results suggest that all three cystines are 
either present or absent, without the existence of intermediate states 
containing one or two disulphide-bridges. 

Reversed-phase high performance liquid chromatography (RP- 
HPLC) showed that oxhBD-1 eluted after 30 min (Fig. 2b). Incubation 
with increasing concentrations of DTT resulted in a shift towards a peak 
at 33.5 min, representing the completely reduced hBD-1 (confirmed by 
MALDI-MS, data not shown) and indicating an increase of hydropho- 
bicity of redhBD-1 in solution. 

To elucidate structural changes further, nuclear magnetic resonance 
(NMR) experiments on recombinant, uniformly 'SN-labelled reduced 
and oxidized hBD-1 were performed. oxhBD-1 showed a well-dispersed 
'5\-heteronuclear single quantum coherence (HSQC) spectrum as 
expected fora folded protein (Fig. 2c). In addition, the detected cross peaks 
in the HNH- and NNH-nuclear Overhauser enhancement spectroscopy 
(NOESY) spectra as well as the restricted dynamics observed in the 
"H-'°N-heteronuclear NOE (hetNOE) experiment indicated a highly 
structured peptide (data not shown). In contrast, the dispersion of the 
HSQC spectrum of redhBD- 1 indicated a lack ofhydrogen bonds for most 
parts of the protein (Fig. 2c). Accordingly, the HNH- and NNH-NOESY 
spectra showed significantly less cross peaks and the dynamics of the 
hetNOE experiment clearly support an unstructured, highly flexible poly- 
peptide chain. Similar findings were obtained by circular dichroism (CD) 
spectroscopy (Fig. 2d). oxhBD-1 displayed a characteristic minimum at 
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Figure 2 | Reduced hBD-1 differs structurally from oxidized hBD-1. 

a, hBD-1 (1 ig) was incubated with different concentrations of DTT, alkylated 
and analysed by MALDI-MS. b, hBD-1 (1 jig) was incubated with different 
concentrations of DTT and analysed by RP-HPLC. oxhBD-1, oxidized hBD-1; 
redhBD-1, reduced hBD-1. c, Nuclear magnetic resonance (NMR) analysis of 
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209 nm for alpha-helices anda significant signal at 218 nm, corresponding 
to the beta-sheet content, indicating a well-folded peptide. In contrast, 
redhBD-1 did not show these minima but displayed a minimum around 
195 nm, indicating a random coil-like structure, being consistent with 
previous data’. Indeed, calculation of secondary structure elements pro- 
posed an increase in the random coil fraction for the reduced peptide 
(Supplementary Table 1). From these structural observations we pro- 
pose that antimicrobial activity of hBD-1 in a reducing environment is 
attributable to its reduced, unstructured form. 

To exclude artificial effects of the reducing agent DIT we compared 
the antimicrobial activity of oxhBD-1 and redhBD-1 against different 
bifidobacteria under anaerobic conditions in medium lacking DTT. 
Synthetic, oxidized hBD-1 did not affect growth of bifidobacteria 
whereas linear hBD-1 showed antimicrobial activity against all examined 
strains (Fig. 3a and Supplementary Fig. 3). Similar results were obtained 
for Gram-positive lactobacilli, whereas we detected no effect of any hBD- 
1 against Gram-negative Bacteroides vulgatus (Fig. 3b). For the facul- 
tative anaerobe Escherichia coli K12 we found antimicrobial activity of 
both oxhBD-1 and redhBD-1, which is in accordance with literature'*'*. 
When testing the commensal, facultative pathogenic fungus Candida 
albicans we found potent antifungal activity of redhBD-1 but not 
oxhBD-1 against four out of five strains (Fig. 3c) when using a flow 
cytometric antimicrobial killing assay'’. Consequently, reduced hBD-1 
is not only antimicrobial against Gram-positive anaerobes of the human 
normal flora but also against a facultative pathogenic fungus of clinical 
importance. 

Subsequently, we generated recombinant Ala/Ser-variants of hBD-1 
in which all cysteine residues were substituted by alanine or serine to 
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hBD-1. Superimposed 'H-'°N-heteronuclear single quantum coherence 
(HSQC) spectra of oxidized (black) and reduced (red) hBD-1. 5,-°N, chemical 
shifts of '°N nuclei; 6,-'H, chemical shifts of protons. d, Circular dichroism 
spectroscopy of oxidized (black) and reduced (red) hBD-1. Ompw; molar 
ellipticity normalized to mean amino acid residue weight. 
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Figure 3 | Reduced but not oxidized hBD-1 has a microbicidal effect. 

a, b, oxhBD-1 and redhBD-1 (1.5 jg) were incubated with bifidobacteria 

(a) and lactobacilli (3 ug defensin), Bacteroides vulgatus and Escherichia coli 
(b). B. adol., B. adolescentis; L. acid., Lactobacillus acidophilus. c, Flow 
cytometric antimicrobial killing assay of C. albicans incubated with reduced 
(red) or oxidized (black) hBD-1. Data are presented as mean = s.e.m. of two 
independent experiments each done in duplicates. d, RP-HPLC of oxidized and 


investigate if a linear structure itself accounts for the augmented activity 
(Fig. 3d). All variants were either inactive or only weakly active against 
bifidobacteria and lactobacilli (Supplementary Fig. 4). The only bac- 
terium being sensitive was E. coli K12, which was also sensitive to 
oxhBD-1 (Fig. 3b). Since redhBD-1 showed increased hydrophobicity, 
we performed RP-HPLC analyses of the Ala/Ser-variants (Fig. 3d), 
revealing that these eluted earlier, indicating a lower surface hydropho- 
bicity compared to redhBD-1. 

A truncated variant of hBD-1 (hBD-1,30-36) lacking the seven C- 
terminal amino acids GKAKCCK did not inhibit growth of B. adoles- 
centis under normal nor reducing conditions whereas the C-terminal 
seven amino acid peptide itself was antimicrobially active (data not 
shown). Comparable activity was observed when reversing the amino 
acid sequence of the C-terminal peptide while substituting cysteines by 
alanines or serines abolished antimicrobial activity completely. 
Accordingly, unfolding itself is not sufficient for antimicrobial activity 
of redhBD-1; rather hydrophobicity and free cysteine residues located 
in the C terminus are important features (Supplementary Fig. 5). 
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reduced hBD-1 and alanine/serine variants. e, Flow cytometric antimicrobial 
killing assay of B. adolescentis incubated with reduced (red) or oxidized (black) 
hBD-1 for 30 (squares) or 90 (triangles) minutes. Results are presented as 
mean + s.e.m. of two independent experiments each done in duplicates. 

f, Transmission electron microscopy of B. adolescentis incubated with oxidized 
hBD-1, without defensin (control), reduced hBD-1 or with hBD-3. Scale bars, 
200 nm. 


By using a flow cytometric antimicrobial killing assay’” we found that 
redhBD-1 had a rapid bactericidal effect: redhBD-1, but not oxhBD-1, 
caused bacterial membrane depolarization of B. adolescentis after 
incubation for only 30 min (Fig. 3e). Correspondingly, when perform- 
ing transmission electron microscopy with B. adolescentis (Fig. 3f), 
oxhBD-1 did not cause any morphological damage of bacterial cells. 
In contrast, incubation with redhBD-1 caused loss of stainable cyto- 
plasmic content through a process which seemed to be initiated intra- 
cellularly. Hence, we assume that linear hBD-1 kills bacteria by using a 
mechanism being different from that of hBD-3, which primarily attacks 
the bacterial membrane (Fig. 3f). 

Physiologically, enzymatic redox-regulation is mainly controlled by 
thioredoxin (TRX), a multifunctional and ubiquitously expressed oxi- 
doreductase'''*. Because TRX is expressed by mucosal surfaces and 
shows extracellular functions’, we tested if hBD-1 is a natural substrate 
for thioredoxin. Therefore we incubated oxhBD-1 with different con- 
centrations of TRX in the presence of nicotinamide adenine dinucleotide 
phosphate (NADPH) and thioredoxin reductase (TrxR), composing the 
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Figure 4 | Thioredoxin (TRX) catalyses reduction of oxidized hBD-1 and 
co-localizes with redhBD-1 in vivo. a, Oxidized hBD-1 was incubated with 
human thioredoxin, rat thioredoxin reductase and NADPH. Incubation 


mixtures were analysed by RP-HPLC. b, Immunohistochemical analysis of 
reduced/alkylated hBD-1 (red/alkhBD-1; left), thioredoxin (TRX, centre) and 


natural thioredoxin system"', and analysed samples by RP-HPLC to 
evaluate oxidized and reduced hBD-1 fractions. hBD-1 was reduced by 
thioredoxin in a concentration-dependent fashion (Fig. 4a). Likewise, 
increasing TrxR concentration increased the amount of redhBD-1 
while omitting TrxR prevented reduction (data not shown). These 
findings were confirmed by a sensitive spectrophotometric protein 
disulphide reduction assay”® measuring the consumption of NADPH 
(Supplementary Fig. 6). Although minor amounts of NADPH were 
consumed when using protein disulphide isomerase (PDI) instead of 
TRX or by using TrxR without TRX (data not shown), we propose that 
the complete thioredoxin system might be a physiological mediator 
catalysing effective reduction of oxhBD-1. 

To document the presence of reduced hBD-1 in vivo, we generated a 
redhBD-1-specific antibody. This antibody recognized reduced hBD-1 
and alkylated hBD-1, but not correctly folded synthetic or recombin- 
ant hBD-1 in an immunodot assay as well as during western blot 
analyses (Supplementary Fig. 7). redhBD-1 was observed in human 
colonic mucosa and at the bottom of small intestinal crypts by 
immunohistochemistry staining (Fig. 4b). Additionally, redhBD-1 
staining was observed in human skin epidermis, whereas the protein 
G-purified pre-immune IgG fraction did not cause any staining. 
Analyses with a TRX-specific antibody revealed a similar staining 
pattern, indicating that redhBD-1 and TRX co-localize and thereby 
strengthened the hypothesis that TRX is a physiological mediator 
catalysing reduction of hBD-1 in human epithelia. 

In inflamed tissue from rodent models, application of recombinant 
thioredoxin has a beneficial effect and ameliorates colitis”’. We there- 
fore studied TRX mRNA in human inflammatory bowel disease and 
found it to be decreased in inflamed colonic tissue (Supplementary Fig. 
8). It remains unclear whether this relative lack of thioredoxin could 
perpetuate inflammation via a reduction of intestinal antibiotic barrier 
function. Unfortunately, due to differences in the antimicrobial pep- 
tide repertoire in rodents compared to humans definite proof of such a 
concept will be difficult to obtain. 

Anaerobic niches can be found in cutaneous sweat- and sebaceous 
glands, wounds, infectious sites as well as mucosal membranes of 
intestine, vagina, oral cavities and others**’’. It seems plausible that 
hBD-1 and thioredoxin are constitutively produced by these epithelia 
to provide a hostile environment for (facultative) anaerobic pathogens 
or commensals. A disease relevance of these mechanisms appears 
likely, since single nucleotide polymorphisms in the hBD-1-encoding 
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Pre-immune IgG 
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pre-immune IgG fraction (right). Staining of human colon (upper row), ileum 
(middle row) and skin sections (bottom row) are shown. In the pre-immune 
IgG-treated skin (bottom row, right), brownish cells in the basal epidermal 
layer are melanocytes. Scale bars, 50 jim. 


DEFB1 promoter region have been associated with increased risk of 
caries, periodontitis, Candida infection and Crohn’s Disease**”*. 

Because antimicrobial activity of hBD-1 was found to be comparably 
low'®” it was paradoxical why an organism produces high amounts of 
an ineffective defence molecule. Here we show that reducing disul- 
phide bonds unmasks potent antimicrobial activity of hBD-1. 
Although its definite in vivo relevance remains to be demonstrated, 
these findings provide a new mechanism: redox modulation depend- 
ing on enzymatic and environmental factors strongly augments anti- 
biotic killing, a finding which opens new avenues in understanding the 
complex process of mucosal as well as skin host protection against 
commensals and pathogens. 


METHODS SUMMARY 

Antimicrobial assay. Antimicrobial assay was modified from ref. 12. Bacteria 
(except E. coli K12) were grown anaerobically and the assay was performed under 
anaerobic conditions with antimicrobial peptides or its variants/fragments and 0, 1 
and 2mM DTT. 

Generation of recombinant hBD1 and its variants. Generation of hBD-1 and its 
variants is described in the Supplementary Methods and Supplementary Table 2. 
Nuclear magnetic resonance (NMR) spectroscopy. Uniformly '°N-labelled 
reduced and oxidized hBD-1 was solubilised in H,O containing 10% DO to a 
final concentration of 0.5-0.7 mM. All spectra were recorded at 298 K. 

Circular dichroism (CD) spectroscopy. CD spectroscopy was carried out at 
25°C in 10 mM sodium phosphate, pH 7.4. Spectra were recorded at 0.1 mg ml! 
in the far-ultraviolet range. 

Transmission electron microscopy of bacteria. Bifidobacterium adolescentis was 
incubated with 200 jg ml’ synthetic defensin. Bacteria were fixed in Karnovsky’s 
fixative, embedded in agarose, coagulated, cut in small blocks and fixed again in 
Karnovysky’s solution. After post-fixation and embedding in glycid ether blocks 
were cut using an ultra microtome. Sections (30nm) were mounted on copper 
grids and analysed using a Zeiss LIBRA 120 transmission electron microscope. 
Generation of anti-reduced-hBD-1-specific antibodies. Generation of anti- 
reduced-hBD-1-specific antibodies and its use for immunohistochemistry and 
dot blot is described in Methods and Supplementary Fig. 7. 

Western blot analysis. Western blot analysis of reduced/alkylated hBD-1 is 
described in ref. 30 and Methods. 

Thioredoxin reduction assay. Thioredoxin reduction assays were performed 
analogous to ref. 20. Briefly, oxhBD-1 was incubated with 0.5-2.0 1M thioredoxin 
or protein disulphide isomerase in the presence of 100 nM thioredoxin reductase 
and 0.8mM NADPH in 0.1M potassium phosphate buffer-2mM EDTA. 
Incubation mixtures were analysed with RP-HPLC. For spectrophotometric 
disulphide reduction assay similar conditions were used in a volume of 100 ul, 
absorbance decrease was monitored at 340 nm. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Bacterial and fungal strains. Bacterial strains B. adolescentis Ni3,29c (clinical 
isolate), B. adolescentis DSM20038T (reference strain from German Collection of 
Microorganisms and Cell Cultures (DSM, Germany), B. adolescentis PZ 4009 
(clinical isolate), Bifidobacterium breve DSM20213T (DSM reference strain), 
B. breve PZ 1343 (from probiotic VSL#3), B. breve Ha6/14c (clinical isolate), 
Bifidobacterium longum DSM 20219T (clinical isolate), B. longum So2/88b2a 
(clinical isolate), Lactobacillus acidophilus PZ 1138 (clinical isolate), and 
Lactobacillus fermentum PZ 1162 (clinical isolate) were obtained from 
Ardeypharm (Germany) and Bacteroides vulgatus DSM1447 was obtained from 
the German Collection of Microorganisms and Cell Cultures (DSM). Candida 
albicans strains 143, 146 and 526 were isolated from faeces, strains 277 and 828 
from tracheal secretions and provided by the Institute of Laboratory Medicine, 
Klinik am Eichert (Géppingen, Germany). 

Antimicrobial assay. Antimicrobial radial diffusion assay for anaerobic bacteria 
was modified from ref. 12 (see refs 31 and 32). Bacteria were grown anaerobically 
(AnaeroGen, Oxoid) for 24h at 37 °C on Columbia Blood agar plates, inoculated 
into liquid trypticase soy broth (TSB) medium and cultivated for another 24h. 
Bacterial cultures were washed and diluted to attenuance (De20nm) = 0.1, 150 ul 
were used for killing assay. Incubation was carried out in 10 ml of 10 mM sodium 
phosphate containing 0.3 mg ml‘ of TSB powder and 1% (w/v) low EEO-agarose 
(Sigma-Aldrich) with 0-2 mM DTT (Sigma-Aldrich) under anaerobic conditions 
with synthetic hBD-1, hBD-3 (both Peptide Institute), lysozyme (Sigma-Aldrich), 
hBD-1 variants (recombinantly expressed as described) or synthetic C-terminal 
heptapeptides (emc microcollections GmbH). Reducing milieu was monitored by 
addition of redox indicator resazurin (1 mgl~’, Sigma-Aldrich). Reduced hBD-1 
was obtained by reduction with 20mM DTT, purification with RP-HPLC as 
described, drying with vacuum-centrifuge and storage under argon gas at —20 °C. 
An overlay-gel containing 6% (w/v) TSB powder, 1% agarose and 10 mM sodium 
phosphate buffer (pH 7.4) with or without DTT was poured onto the plates after 3 h 
and after incubation for 48 h at 37 °C (Bifidobacterium bifidum was incubated for up 
to 4days) the diameter of inhibition zones was measured and in part statistically 
evaluated using GraphPad Prism 4.03 (Graphpad Software) and Student's t-test with 
*P <0.05, **P < 0.01, ***P < 0.001. 

Matrix-assisted laser desorption/ionization mass spectroscopy (MALDI-MS). 
hBD-1 (1g) was incubated with different concentrations of DIT in 10mM 
sodium phosphate buffer, pH 7.4, for 30 min at 37 °C, followed by alkylation with 
20 mM iodoacetamide for 30 min at 25°C and co-crystallized with o-cyano-4- 
hydroxy cinnamic acid. MALDI-MS was carried out at an ultraflex TOF/TOF 
machine (Bruker). 

High performance liquid chromatography (HPLC) analysis. hBD-1 (1 1g) was 
incubated with different concentrations of DTT in 10 mM sodium phosphate buf- 
fer, pH 7.4, for 30 min at 37 °C. HPLC analysis was carried out with an Agilent 1200 
series system (Agilent) and a Vydac 218TP-C18 Column (250 X 4.6mm, 5 um, 
Grace). Gradient increased from 2% B to 35% B in 33 min (solvent A, water + 
0.18% (v/v) trifluoroacetic acid (TFA); solvent B, acetonitrile + 0.15% (v/v) TFA) 
at 25°C and 0.8 ml min |. 

Generation of recombinant hBD1 and its variants. Generation of recombinant 
hBD-1 and its variants is described in the Supplementary Methods part and 
Supplementary Table 2. 

Nuclear magnetic resonance (NMR) spectroscopy. Lyophilized uniformly '°N- 
labelled reduced and oxidized hBD-1 was solubilised in HO containing 10% DO 
to a final concentration of 0.5-0.7mM. The reduced peptide was kept under 
nitrogen atmosphere during measurement. All spectra, the 'H-!°N-heteronuclear 
single quantum coherence (HSQC), the heteronuclear edited NNH-nuclear 
Overhauser enhancement spectroscopy (NOESY), the conventional '"N-HSQC- 
NOESY and the 'H-'°N-heteronuclear NOE spectrum, were recorded at 298 K on 
a Bruker AVIII-600 spectrometer (Bruker). 

Circular dichroism (CD) spectroscopy. CD spectroscopy was carried out on a 
Jasco J-715 spectropolarimeter (Jasco Corporation, Japan). Reduced hBD-1 pep- 
tide was obtained by reduction with 20mM DTT and purification with HPLC 
analysis as described. Reduced peptide was kept under argon atmosphere until CD 
measurement and spectra were recorded in 10mM sodium phosphate buffer, 
pH7.4 at a concentration of 0.1 mg ml‘ in the far-ultraviolet range at 25°C. 
Spectra were deconvoluted with CDSSTR and Selcon3 algorithms. 

Flow cytometric antimicrobial assay. Flow cytometric antimicrobial assay mea- 
suring membrane depolarization of bacteria and fungi was carried out as described 
elsewhere”, Briefly, 1.5 X 10° cells per ml were incubated in 1:6 diluted Schaedler 
broth at 37 °C with hBD-1 peptide in a final volume of 50 ul. hBD-3 served as 
positive control. Defensins were dissolved in 0.01% acetic acid and added to 
bacterial/ fungal suspensions at indicated final concentrations (40 1g ml‘ for 
C. albicans). Bacterial or fungal suspensions incubated with solvent (0.01% acetic 
acid) served as controls for viability. After 30 and 90 min the suspensions were 


incubated for 10min with 1mgml’ of the membrane potential sensitive dye 
[bis-(1,3-dibutylbarbituric acid) trimethine oxonol] (DiBAC4(3)) (Invitrogen, 
USA). Suspensions were centrifuged and the sediments were resuspended in 
300 pl phosphate-buffered saline. The percentage of depolarized fluorescent bac- 
teria or fungi in suspension was determined by a FACSCalibur flow cytometer 
(BD, USA) using Cell Quest software (BD) for 30 000 events per sample. 
Transmission electron microscopy of bacteria. Approximately 2X 10° CFU of 
Bifidobacterium adolescentis Ni3,29c were incubated for two hours at 37 °C in an 
anaerobic jar in 10 mM sodium phosphate buffer (pH 7.4) containing 0.3 mg ml’ 
of TSB-powder in the absence or presence of 200 jg ml ' synthetic oxidized hBD- 
3, oxidized hBD-1 (Peptide Institute Inc.) and reduced hBD-1. For transmission 
electron microscopy, bacteria were centrifuged, fixed in pre-warmed Karnovsky’s 
fixative for 1h at room temperature and stored at 4°C for 24h. After centrifu- 
gation, the sediment was embedded in 3.5% agarose at 37 °C, coagulated at room 
temperature, cut in small blocks and fixed again in Karnovsky’s solution for at least 
lh. Post-fixation was based on 1.0% osmium tetroxide containing 1.5% 
K-ferrocyanide in 0.1 M cacodylate buffer for 2h. After embedding in glycid ether 
the blocks were cut using an Ultracut microtome (Reichert, Austria). Ultra-thin 
sections (30 nm) were mounted on copper grids and analysed using a Zeiss LIBRA 
120 transmission electron microscope (Carl Zeiss, Germany) operating at 120 kV. 
Generation of anti-reduced-hBD-1-specific antibodies. Polyclonal anti-red- 
hBD-1 antibodies were generated in rabbits against a mixture of reduced and 
alkylated recombinant full-length hBD-1 as antigens. For each rabbit, a total of 
0.9 mg of a protein mixture including 600 ug of HPLC-purified carboxamido- 
methylated hBD-1 and 300 ug of HPLC-purified reduced hBD-1 was conjugated 
to glutaraldehyde-treated maleimide-activated keyhole limpet haemocyanin 
(KLH) (protein-KLH 1:1, w/w) in phosphate buffered saline (PBS, pH 7.2) and 
used as immunogens. Immunization was carried out four times on days 0, 14, 28 
and 35. Rabbits were bled 2 weeks after the last booster. Antisera were first applied 
to a HiTrap Protein G HP column (GE Healthcare, Germany) to separate IgG 
fraction. Next, antibodies were further purified by affinity chromatography using 
an alkylated hBD-1-column which was prepared from 1 mg highly HPLC-purified 
alkylated hBD-1-pet32-fusion protein that was covalently bound to a HiTrap 
N-hydroxysuccinimide (NHS)-activated HP 1 ml column (GE Healthcare). To 
deplete any cross-reacting antibodies recognizing the correctly folded oxhBD-1, 
the purified antibodies were further applied to an ox-hBD-1-column, which was 
generated from a HiTrap NHS-activated HP 1 ml column, where correctly folded 
oxhBD-1 (prepared as described) was covalently bound. Specificity was tested by 
immunodot and western blot analyses (Supplementary Fig. 7). As antigens alkylated 
hBD-1, synthetic, correctly folded oxhBD-1, purified recombinant oxhBD-1 and 
freshly prepared redhBD-1 were used. Upon immunodot and western blot-analyses 
purified antibody preparations recognized specifically redhBD-1 and alkylated 
(alk)hBD-1, but neither synthetic, correctly folded nor purified, recombinant 
oxhBD-1. In addition, pre-incubation of the antibody with alkylated hBD-1 pre- 
vented immunohistochemical staining (Supplementary Fig. 7). 

Thioredoxin reduction assay. Thioredoxin reduction assays were performed 
analogous to A. Holmgren”’. For RP-HPLC analysis 10 1M synthetic, oxhBD-1 
was incubated with 0.8 mM NADPH (biomol, Germany), 100 nM rat thioredoxin 
reductase (IMCO, Sweden), 0.0 - 1.5 uM human thioredoxin (Sigma-Aldrich) or 
up to 2.0uM bovine protein disulphide isomerase (PDI, Sigma-Aldrich) for 
30min at 37°C in 0.1 M potassium phosphate-2mM EDTA, pH7.0 buffer. 
Incubation mixtures were acidified with TFA, mixed with HPLC solvent and 
analysed with HPLC as described and conversion from oxidized into reduced 
hBD-1 peptide was followed by retention time. For spectrophotometric disulphide 
reduction assay similar conditions were used with the following exceptions: 
oxhBD-1 concentration was either 25 uM while TRX concentration was 0.5 - 
2.0 uM or hBD-1 concentration ranged from 12.5 to 100 uM while TRX concen- 
tration was constant at 2.0 uM. Absorbance decrease was monitored at 340 nm for 
up to 60 min at 37 °C in a 96-well plate reader and a final volume of 100 ul. All 
values were corrected against a control which contained buffer instead of hBD-1; 
experiments were repeated at least twice. 

Immunohistochemistry. Fixation of tissue samples and biopsies was performed in 
4% formalin containing 20 mM iodoacetamide to conserve open disulphide bonds 
and a protease-inhibitor cocktail (Complete, Roche, Germany). Paraffin sections 
(5 um) of tissue samples were deparaffinised and rehydrated before heat-induced 
antigen retrieval was performed in 0.01M citrate buffer (pH 6.0). Slides were 
blocked with 12% bovine serum albumin in Tris-buffered saline, pH 7.4, before 
staining. Immunohistochemical staining was performed at room temperature for 
one hour using either affinity-purified polyclonal rabbit reduced/alkylated-hBD-1- 
antibody (5-20 pg ml~ ) or rabbit thioredoxin antibody (intestinal sections, Dako, 
1:400) or goat thioredoxin antibody (skin sections, R&D Systems, 1:500). A bio- 
tinylated secondary pig anti-rabbit IgG antibody (1:300, Dako Cytomation) or 
rabbit anti-goat IgG antibody (1:500, Jackson Immuno Research Lab) was used, 
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followed by incubation with Vectastain ABC Kit Elite Pk-6100 (Vector, USA) 
developed with Vector NovaRED substrate kit for peroxidase Sk-4800 (Vector) 
and counterstained with hematoxylin. Specificity test of the reduced/alkylated 
hBD-1-antibody was performed by using protein G-purified pre-immune IgG 
(Fig. 4b) and blocking with alkylated hBD-1 (Supplementary Fig. 7). 

Immunodot blot analysis. Proteins (synthetic, correctly folded oxidized hBD-1 in 
the absence or presence of 2 mM (tris(2-carboxyethyl) phosphine (TCEP); recom- 
binantly expressed, refolded and HPLC-purified oxidized hBD-1 in the absence or 
presence of TCEP as well as HPLC-purified alkylated hBD-1) were dissolved in 
0.1% (v/v) formic acid and were dotted to a Protran-nitrocellulose membrane 
(Schleicher & Schuell BioScience, Germany), blocked for 1 h in 5% (w/v) nonfat 
powdered milk in PBS + 0.05% Tween (PBST) and incubated for 18 h at 4 °C in 3% 
(w/v) nonfat powdered milk in PBST containing 8.8 jg ml“! affinity-purified poly- 
clonal reduced/alkylated rabbit hBD-1-antibody. The membrane was washed with 
PBST six times for 5 min each, then incubated for 1 h in 3% (w/v) nonfat powdered 
milk in PBST containing 1:20.000 dilution of goat anti-rabbit IgG HRP conjugate 
(Dianova, Germany). After six washing steps the membrane was incubated for 
5 min with chemiluminescent peroxidase substrate (Sigma) and visualized using a 
Diana III cooled CCD-camera imaging system (Raytest, Germany). Densitometric 
quantifications were performed using AIDA evaluation software (Raytest). Quality 
control experiments for the reduced/alkylated-hBD-1-antibody in the immunodot 
system (performed by pretreatment of the antibody with 5 jig/ml alkylated hBD-1) 
revealed no staining with oxidized, reduced as well as alkylated hBD-1 (not shown). 
Western blot analysis. Samples were boiled in sampling buffer in the absence or 
presence of 2mM DTT as indicated and electrophoresed in a system optimized for 
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the analysis of 1-10 kDa-peptides as previously described for AMPs”. Briefly, 
samples were electrophoresed in a 16.5% SDS-tricine polyacrylamide gel contain- 
ing 8 M urea but no reducing agent. Proteins were transferred to a Protran- 
nitrocellulose membrane (Schleicher & Schuell BioScience, Germany) and then 
treated analogue to immunodot analysis. 

Isolation of total RNA and real-time PCR. Frozen biopsies were disrupted mech- 
anically and total RNA was isolated using TRIzol reagent according to the supplier’s 
protocol. RNA was reverse transcribed with Superscript II reverse transcriptase 
(Invitrogen) into cDNA, according to the standard protocol of the manufacturer. 
cDNA derived from 10 ng of total RNA served as a template for real-time PCR 
reaction. The expression levels of thioredoxin mRNA were quantified by real-time 
PCR using a fluorescence detection monitor (Light-Cycler; Roche Diagnostics) using 
oligonucleotides as shown in Supplementary Table 2. All expression levels were 
normalized with respect to the expression of beta-actin. To calculate statistically 
significant differences between groups, samples were analysed by Mann-Whitney 
test. Gaussian distribution was determined using the Kolmogorov-Smirnov test. 
Results are presented as mean + s.e. Values of P< 0.05 were considered to be stat- 
istically significant. Data were analysed using Graphpad Prism version 4. 
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Atomic-level modelling of the HIV capsid 


Owen Pornillos'*, Barbie K. Ganser-Pornillos'’* & Mark Yeager’? 


The mature capsids of human immunodeficiency virus type 1 
(HIV-1) and other retroviruses are fullerene shells, composed of 
the viral CA protein, that enclose the viral genome and facilitate its 
delivery into new host cells’. Retroviral CA proteins contain inde- 
pendently folded amino (N)- and carboxy (C)-terminal domains 
(NTD and CTD) that are connected by a flexible linker’*. The 
NTD forms either hexameric or pentameric rings, whereas the 
CTD forms symmetric homodimers that connect the rings into a 
hexagonal lattice**"*. We previously used a disulphide crosslink- 
ing strategy to enable isolation and crystallization of soluble HIV-1 
CA hexamers’”"*. Here we use the same approach to solve the X-ray 
structure of the HIV-1 CA pentamer at 2.5A resolution. Two 
mutant CA proteins with engineered disulphides at different 
positions (P17C/T19C and N21C/A22C) converged onto the same 
quaternary structure, indicating that the disulphide-crosslinked 
proteins recapitulate the structure of the native pentamer. 
Assembly of the quasi-equivalent hexamers and pentamers 
requires remarkably subtle rearrangements in subunit interac- 
tions, and appears to be controlled by an electrostatic switch that 
favours hexamers over pentamers. This study completes the gallery 
of substructures describing the components of the HIV-1 capsid 
and enables atomic-level modelling of the complete capsid. Rigid- 
body rotations around two assembly interfaces appear sufficient to 
generate the full range of continuously varying lattice curvature in 
the fullerene cone. 

By analogy to quasi-equivalent icosahedral virus capsids, the sub- 
units of the conical fullerene capsid of HIV-1 are organized on a 
hexagonal lattice that requires insertion of exactly 12 pentamers to 
close the shell®’*’”. However, the pentamers in an icosahedral capsid 
are in a symmetric configuration at the vertices, whereas pentamers in 
the fullerene cone are distributed asymmetrically, with five at the 
narrow end and seven at the wide end’*. The Rous sarcoma retrovirus 
(RSV) CA protein can form icosahedrally symmetric particles in vitro, 
and electron cryomicroscopy maps at 10 A resolution indicate that the 
hexamer and pentamer are indeed quasi-equivalent; that is, RSV CA 
forms both oligomers using the same interacting surfaces””’. 

We previously used a disulphide crosslinking strategy to facilitate 
purification and structure determination of HIV-1 CA hexamers!?”*. 
To explore the molecular basis of retroviral CA quasi-equivalence, we 
have now determined X-ray structures of two disulphide-crosslinked 
HIV-1CA pentamers at 2.5 A and6 A resolution (Supplementary Figs 1 
and 2, and Supplementary Table 1). The independent pentamer struc- 
tures were closely superimposable (Supplementary Fig. 3). Therefore 
the crosslinked proteins likely recapitulate the native pentameric archi- 
tecture of HIV-1 CA. The structures were also similar to the lower- 
resolution structures of the RSV CA pentamer””’. 

Comparison of the new pentamer structures with the HIV-1 CA 
hexamer structures*’”"* confirms that the two oligomers are highly 
related (Fig. 1). In both cases, the CTDs form a ‘belt’ (blue in Fig. 1a, d) 
surrounding the inner ring of NTDs (orange). The NTD and CTD of 
each CA molecule cradle the NTD from the neighbouring subunit 
(illustrated for the pentamer in Fig. 1b, c; for the hexamer in Fig. le, 
f; and for both oligomers in superposition in Fig. 2a). Intermolecular 


NTD-NTD interactions facilitate formation of the NTD rings, 
whereas NTD-CTD interactions hold the CTD subunits in the ‘belts’ 
against the inner NTD rings. There are no intramolecular interactions 
between the NTD and CTD of each subunit, apart from the covalent 
peptide linkage between these domains (red arrowhead in Fig. 2a). The 
ability of the inherently flexible linker to adopt different conformations 
facilitates appropriate juxtaposition of the same interaction surfaces in 
both the pentamer and hexamer. 

Previous analysis indicated that the hexameric NTD ring closely 
obeys sixfold rotational symmetry and is relatively rigid, whereas the 
CTD subunits in the ‘belts’ are mobile and can rotate relative to the 


Figure 1 | Structure of the disulphide-stabilized HIV-1 CA pentamer and 
comparison with the hexamer. a, Top view of the pentamer, with the NTD 
coloured in orange and the CTD in blue. Helices are represented as cylinders. 
b, c, Top view (b) and side view (c) of the pentamer, with the helices as ribbons. 
Each subunit is in a different colour. d—f, Equivalent views of the hexamer 
(Protein Data Bank accession number 3H4E)"'. The yellow spheres in a and 
d indicate the positions of the pentamer-stabilizing (N21C/A22C) and 
hexamer-stabilizing (A14C/E45C) disulphide bonds, respectively. 
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Figure 2 | Comparison of the pentamer and hexamer interactions. Both 
oligomers are created by quasi-equivalent packing of each of the two domains 
of one CA subunit (coloured in green) with the NTD ofa second subunit (blue). 
a, Stereo view superposition of the pentamer interface (dark colours) and the 
hexamer interface (light colours). The structures are superimposed on the blue 
NTD. b, Close-up view of representative NID-CTD contact regions in the 
pentamer and hexamer. Key residues are shown explicitly and labelled, with 


NTD ring". Each CTD pivots as a rigid body about four intermolecular 
helix-capping hydrogen bonds at the NID-CTD interface. With these 
motions, each hexameric ring can adopt slightly different dihedral 
angles relative to its adjacent rings in the capsid lattice?!”. 
The NTD-CTD interfaces within the pentamer and hexamer are 
remarkably similar (Fig. 2b). Indeed, comparison of the crystallogra- 
phically distinct pentamers reveals the same type of flexibility as 
the hexamer (that is, the fivefold symmetric NTD ring is apparently 
rigid, and the CTDs are more mobile) (Fig. 2c and Supplementary 
Fig. 3). These results indicate that the pentamer and hexamer use the 
same mechanism to accommodate local variations in capsid lattice 
curvature. 

NTD ring interactions are mediated by the first three a-helices of 
each subunit, which form a 15-helix barrel in the pentamer (Fig. 3a) 
and an 18-helix barrel in the hexamer (Fig. 3b). There are subtle 
differences in the repeating set of NID-NTD contacts, which com- 
prise a three-helix bundle, with helix 2 of one subunit (orange in 
Fig. 3c) packed lengthwise against helices 1 and 3 of the adjacent 
subunit (blue). Aliphatic sidechains at the centre of the bundle form 
a small hydrophobic core, whereas polar residues at the periphery 
participate in hydrophilic interactions. As noted previously, direct 
hydrophilic protein-protein contacts are conspicuously absent in the 
hexameric NTD ring, and essentially all the intersubunit hydrogen 
bonds are bridged by ordered water molecules’. In the pentamer 
structure, ordered waters were not modelled (see Methods), but exami- 
nation of residual difference density indicates that the assembly inter- 
faces are likewise solvated (not shown). 

The fivefold and sixfold symmetric NTD rings are distinguished 
through the angle subtended by adjacent subunits (72° in the pentamer 
and 60° in the hexamer, with the angle vertex at the centre of each ring) 
(Fig. 3a, b). To a first approximation, this difference is accommodated 
by a simple rotation of the subunits relative to each other (Fig. 3d). 
Remarkably, the rotation axis appears to coincide with the centre of the 
three-helix bundle (red dot in Fig. 3d), which allows essentially equi- 
valent packing of the aliphatic residues. Thus the hydrophobic NTD- 
NTD interactions are conserved in the two rings. In contrast, the polar 
atoms at the outer edges of the three-helix bundle display substantially 
different interatomic distances, but compensatory movements of 
water molecules appear to maintain ring-stabilizing hydrogen bonds 
(not shown). Our structures therefore indicate that switching between 
the CA hexamer and pentamer occurs through subtle changes in inter- 
subunit bonding interactions, and follow the principles of quasi- 
equivalence as originally envisioned by Caspar and Klug”’. 

Pentamer formation brings charged residues at the centre of the ring 
in close proximity, inducing both attractive and repulsive ionic inter- 
actions. This implies that electrostatic forces control switching 


LETTER 


independent NTD-CTD interfaces in the pentamer, superimposed on the 
NTD. Flexion of the two domains is indicated by the black double-headed 
arrow, and occurs about molecular pivots composed of helix-capping 
intermolecular hydrogen bonds. The relevant side chains are shown explicitly 
and labelled, and hydrogen bonds are indicated by yellow lines. 


between the pentamer and hexamer, as was suggested for RSV CA’. 
For the HIV-1 CA oligomers, the points of closest approach occur at an 
annulus at the top of helix 1, which is occupied by an arginine residue 
(Arg 18). The arginines appear well accommodated within the annulus 
of the hexamer, but are more closely apposed in the pentamer 
(Supplementary Fig. 4). Closer juxtaposition of like charges is expected 
to create stronger electrostatic repulsion, which is consistent with the 
biochemical observation that assembly of HIV-1 CA pentamers is 


° 
[3 


Approximate 
pivot point 


Approximate position 
of fivefold axis 


Figure 3 | Quasi-equivalence in the pentameric and hexameric NTD rings. 
a, b, Top views of the pentameric (a) and hexameric (b) NTD rings, with each 
subunit in a different colour. Subunits in the pentamer and hexamer are shown 
in darker and lighter shades, respectively. The angles subtended by adjacent 
domains are shown explicitly for the blue and orange subunits. One of the 
repeating three-helix units is outlined in black. c, Close-up view of the 
pentameric and hexameric repeat units, superimposed on helices 1 and 3 of the 
blue subunit. The aliphatic residues that form a small hydrophobic core are 
shown explicitly and labelled. d, The ‘rotation’ between adjacent subunits, in 
going from the hexamer to the pentamer. The approximate position of the 
rotation axis is indicated by the red dot. Note that this axis is parallel to neither 
the pentameric nor hexameric symmetry axes. 
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Figure 4 | Model of the HIV-1 capsid. a, Stereo view of a backbone-only 
fullerene cone model composed of 1,056 CA subunits. The hexamers, 
pentamers and dimers are coloured in orange, yellow and blue, respectively. 
Note that the capsid displays a variably curved surface. In the body of the cone, 
curvature changes continually, which was modelled by subunit flexion at the 
NTD-CTD interface. Pentamers alter the trajectory of the hexagonal lattice and 
create regions of sharp curvature (that is, declinations). Exactly 12 declinations 
are required to close a hexagonal lattice. Our modelling suggests that formation 
of the declinations entails a flexible CTD dimer. Note also that the CTD 
subunits surrounding the local threefold axes are in close proximity, consistent 


disfavoured relative to hexamers. Elimination of the charge is expected 
to favour pentamer formation; indeed, mutation of Arg 18 into alanine 
promotes assembly of highly curved particles (cones, spheres, spirals, 
short capped cylinders)”. In contrast, wild-type CA typically assembles 
into long tubes composed of hexamers”. Mutation of Arg 18 to 
valine, isoleucine or leucine induces assembly of spheres (that is, more 
efficient pentamer formation)*; we presume that this is because the 
larger aliphatic sidechains contribute stabilizing hydrophobic contacts. 

The importance of Arg 18 for the energetic landscape of HIV-1 CA 
assembly is consistent with its very high degree of conservation (99.8% 
of 2,460 sequences in the Los Alamos database, http://www hiv.lanl. 
gov/content/index). We propose that electrostatic destabilization of 
the pentamer is precisely counterbalanced by cooperative lattice sta- 
bilization, such that pentamers form and integrate into the assembling 
capsid only when required to relieve strain induced by local lattice 
curvature. We further speculate that the narrow end of the cone 
may be particularly susceptible to destabilization because this region 
has a high concentration of pentamers, and this may be relevant to 
capsid disassembly or uncoating. 

At this time, it is not possible to determine experimentally the atomic 
structure of the native HIV-1 capsid. Nevertheless, having on hand a 
complete gallery of high-resolution structures of the building blocks 
allowed us to model a fullerene cone capsid (Fig. 4a). In our modelling, 
the NTD hexamers, NTD pentamers and CTD dimers were treated as 
rigid bodies, and intersubunit distances across the NTD-CTD inter- 
faces were used as indicators of model quality (see Methods) (Fig. 4b). 

Within the body of the cone, the CA subunits were arranged on a 
hexagonal lattice with a unit cell spacing of approximately 93 A. Con- 
sistent with a previous proposal"', we found that the full range of vari- 
able lattice curvature in this region could be modelled by introducing 
small rigid-body rotations across the NTD-CTD interfaces, while 
keeping the NID-NTD hexamerization and CTD-CTD dimerization 
interfaces constant. In modelling the pentameric declinations, wherein 
the lattice curvature is most pronounced, we found that the CTD dimers 


426 | NATURE | VOL 469 | 20 JANUARY 2011 
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with the finding that this site constitutes a fourth set of capsid-stabilizing 
interactions'’. b, The extent by which the intersubunit distances across the 
modelled NTD-CTD interfaces deviate from the expected value, as a function 
of cone length (deviation = distancemodelled — distanceexpecteas Where 
distancegxpected = 9-0 A, which refers to the average separation of hydrogen- 
bonded pairs in the X-ray structures of the hexameric and pentameric NTD- 
CTD interfaces) (see Methods for details). Note that 99% of the NTD-CTD 
distances in the model are within 1 A of the expected value. This suggests that 
the model is of good quality, in light of the sizeable number of constraints 
imposed on the subunit interactions. 


must span different distances when connecting hexamers to hexamers 
(approximately 33 A) and hexamers to pentamers (approximately 26 A) 
(Supplementary Fig. 5a). These distances are in close correspondence 
to the dimensions of two independently determined CTD dimer struc- 
tures, 2KOD (ref. 12) and 1A43 (ref. 5), and suggests a rationale for the 
seemingly disparate structures of the dimers. Although the structures 
were solved from slightly different protein constructs, both retained 
the dimerization affinity of full-length CA*"*. However, each exhibited 
a distinct subunit packing geometry across the dimer dyad (Sup- 
plementary Fig. 5b). Therefore the 2KOD dimer was used to connect 
hexamers to hexamers, and the 1A43 dimer was used to connect 
hexamers to pentamers. Models wherein the NTD rings were con- 
nected by either dimer alone displayed significant backbone clashes 
(2KOD) or relatively large separations (1A43) between subunits sur- 
rounding the declinations (not shown). This suggests that rotation or 
slippage at the CID-CTD interface may be a mechanistic element of 
capsid assembly, which is consistent with studies showing that the 
CTD has a flexible architecture*””’. 

It is remarkable that our simple modelling approach, which allowed 
just two types of rigid-body rotation between the building blocks, 
produced a fullerene model wherein essentially all the subunits dis- 
played reasonable packing geometries. On the basis of this analysis, we 
conclude that CA assembly entails flexibility at both the NTD-CTD 
interface and the dimer interface to generate the constantly varying 
lattice curvature in the HIV-1 capsid. 


METHODS SUMMARY 


Soluble, disulphide-crosslinked pentamers of HIV-1y14-3 CA (containing either 
N21C/A22C/W184A/M185A or P17C/R1I8L/T19C/W184A/M185A mutations) 
were prepared by sequential dialysis of purified protein. Crystals were obtained 
by the sitting-drop vapour diffusion method in Tris-buffered precipitant solutions 
containing polyethylene glycol and sodium iodide. Synchrotron diffraction data 
were processed with the program HKL2000. Molecular replacement phasing, 
model building and crystallographic refinement were performed with the programs 
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MOLREP, Coot and PHENIX. The capsid model was built by manual rigid-body 
docking of the high-resolution structures of the fivefold symmetric NTD ring 
(Protein Data Bank accession number 3P05), the sixfold symmetric NTD ring 
(3H47) and twofold symmetric CTD dimers (2KOD and 1A43) into a geometric 
fullerene cone model. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Initial characterization of HIV-1 CA pentamer constructs. Protein expression 
and purification protocols have been described in detail'''*. The disulphide- 
stabilized CA pentamers were serendipitously generated in a previous screen for 
disulphide-stabilized hexamers'’"*. In the hexamer study, we engineered several 
pairs of cysteine residues in the NTD and assayed for pairs that efficiently oxidized 
into intermolecular disulphide bonds linking each subunit to its two neighbours in 
the hexameric ring. Hexamer formation and disulphide crosslinking were induced 
by functional assembly of the CA mutants into helical tubes, which are composed 
only of hexamers®'*. We identified two pairs of cysteine mutations (P17C/T19C 
and N21C/A22C) that instead induced assembly of CA spheres approximately 
35 nm in diameter (data not shown, but see Fig. 2 in ref. 14). This result implied that 
the assembling hexagonal lattice now incorporated pentamers*”"’. Consideration 
of the protein mass and volume suggested that these spheres have the expected 
size for a T = 3 icosahedral particle comprising 20 hexamers and 12 pentamers, 
although most of the particles lacked strict icosahedral order. Non-reducing SDS- 
polyacrylamide gel electrophoresis of the spheres was used to confirm that the 
PI7C/T19C and N21C/A22C mutations indeed selectively stabilized the CA 
pentamer (see Fig. 2 in ref. 14). As in the hexamer case’’"“, we also introduced 
the W184A and M185A mutations to disrupt the CTD dimerization interface*”””* 
to prevent the crosslinked pentamers from polymerizing further (without affecting 
formation of the pentameric ring). 

Crystallization and data collection. The mutants that crystallized contained the 
following mutations: N21C/A22C/W184A/M185A and P17C/R18L/T19C/ 
W184A/M185A. The R18L mutation was added to the second construct because 
it promoted more efficient pentamerization (data not shown). Soluble pentamers 
were obtained by sequential dialysis of 10 mg ml | protein into assembly buffer 
(50mM Tris pH 8, 1M NaCl) containing 100 mM f-mercaptoethanol (BME), 
assembly buffer with 0.2 mM BME and, finally, 20 mM Tris pH 8, 40mM NaCl 
without BME. All dialysis steps were performed at 4 °C. 

Soluble pentamers were crystallized in sitting drops, by mixing protein and 

precipitant at a ratio of 2:1. N21C/A22C-stabilized pentamers crystallized in 
0.2M sodium iodide, 0.1M Tris pH 8-9, 27-30% (w/v) PEG 4,000. P17C/ 
T19C-stabilized pentamers crystallized in 0.4M sodium iodide, 0.1M Tris pH 
7-8, 30-32% (w/v) PEG 2,000 MME. Parallelepiped-shaped crystals appeared 
after approximately 3 days at 25 °C and displayed maximal growth (approximately 
400 um in the longest dimension) in 2 weeks. The crystals were very osmotically 
sensitive. Fragments suitable for data collection were obtained by transferring the 
large crystals into mother liquor containing 20-30% (v/v) glycerol and applying 
gentle pressure on the crystals’ surface with a nylon loop. Under favourable con- 
ditions, this resulted in an ‘explosion’ of the crystals into smaller fragments 
approximately 50 1m in size. The fragments were then rapidly mounted and 
flash-frozen in liquid nitrogen. About 1 in 50 fragments gave suitable diffraction 
data, which were indexed, merged and scaled with HKL2000 (ref. 29). 
Structure determination of N21C/A22C-stabilized pentamers. Diffraction data 
to 2.5 A resolution were collected at the Advanced Photon Source beamline 22-BM. 
The crystals belonged to space group P2,, with five CA molecules in the asymmetric 
unit (Supplementary Fig. 1). The self-rotation function showed a strong fivefold 
non-crystallographic symmetry (NCS) axis, indicating that the CA proteins were 
arranged as a pentameric ring. Five NTDs and five CTDs were positioned 
separately with MOLREP*. Model building was performed with Coot*', using 
NCS-averaged maps calculated separately for the NTD and CTD, and a bias- 
minimized map calculated with the Shake&wARP algorithm’. Simulated anneal- 
ing refinement was performed with PHENIX”’, with the two domains defined as 
separate NCS groups. The test set was assigned using the thin shell method, as 
implemented in the program DATAMAN™. The model has good geometry and no 
residues in disallowed regions of the Ramachandran plot**”®. Structure statistics are 
summarized in Supplementary Table 1. 

The asymmetric unit contained several well-ordered iodide atoms, whose posi- 
tions were confirmed with an anomalous difference density map calculated from a 
data set collected at the Advanced Light Source beamline 5.0.2 (wavelength = 1.8 A). 
Smeared anomalous density peaks were also observed at the centre of the NTD 
barrel, but the atomic positions were difficult to pinpoint, and these iodide densities 
were therefore left unmodelled (not shown). Furthermore, the iodide ions appear to 
have perturbed the water-mediated hydrogen-bonding network; thus we did not 
model water molecules. We examined putative water densities in mF, — DF. maps 
calculated with phases from the final model and no NCS averaging. We found that 
many of the residual peaks overlapped with, or had clear correspondence to, water 
positions in the CA hexamer structures (not shown). We therefore believe that, like 


the hexamer, the pentamerization interface includes significant contributions from 
water-bridged hydrogen bonds. 

Structure determination of P17C/T19C-stabilized pentamers. Diffraction data 
to 6 A resolution were collected at the Advanced Photon Source beamline 22-ID. 
The crystals belonged to space group P1, with four pentamers in the unit cell 
(Supplementary Fig. 2). This structure was solved by molecular replacement with 
MOLREP”, using the N21C/A22C pentamer as a search model. Owing to the 
limited resolution, refinement of the P17C/T19C structure in PHENIX* treated 
each NTD and CTD as separate rigid units (Supplementary Table 1). 
Modelling of the mature capsid. Capsid modelling was performed by rigid-body 
docking of the high-resolution structures of the HIV-1 CA hexamer (3H47), pentamer 
(N21C/A22C structure from this study) and dimers (2KOD/NMR and 1A43/X-ray) 
into a geometric model of a fullerene cone (Fig. 5d, e in ref. 19). Because the geometric 
model was built from planar hexagons and pentagons’””, we first defined a corres- 
ponding plane in the hexamers and pentamers for alignment. The hexamer plane was 
defined as a slab, perpendicular to the sixfold symmetry axis, which contained the 
calculated pivot points for flexion at each of the NID-CTD interfaces. The pivot 
point, in turn, was estimated by analysis of the crystallographically independent 
NTD-CTD interfaces in the CA hexamer structures with the program 
HINGEFIND”. Docking was then performed by least-squares alignment of marker 
atoms with the corners of the hexagons, which corresponded to the threefold sym- 
metry axes of the hexagonal lattice. The pentamer was treated in the same manner. At 
this point, the model consisted of full-length CA hexamers and pentamers in which 
the sixfold and fivefold NTD-NTD and NTD-CTD interfaces were invariant and the 
CTD dimer interfaces were distorted. The distorted dimers were then replaced by the 
symmetric CTD dimer structures. This was performed by alignment of the dimers 
through least-squares superposition on the helix 8 Cx atoms: that is, in a manner that 
approximated flexion at the NTD-CTD interface. The final model comprises 166 
NTD hexamers, 12 NTD pentamers, 60 1443 CTD dimers (to connect hexamers to 
pentamers) and 468 2KOD dimers (to connect hexamers to hexamers). 

The X-ray structures of the HIV-1 CA hexamer and pentamer revealed an 
invariant set of helix-capping intermolecular hydrogen bonds at the NTD-CTD 
interface (Arg 173:Nn - Val 59:0; Arg 173:Nn - Asn 57:0; Ala 64:N — Glu 166:O¢; 
and Leu 211:N - Glu 71:0) (Fig. 2c). These interactions were not treated explicitly 
during assembly of the in silico cone model. Therefore the extent by which the 
modelled distances between these pairs of residues recapitulate the experimentally 
observed distances can be used as a gauge of model quality. The average Ca—Cor 
distance for the four pairs was calculated for each NTD-CTD interface (range 7.2- 
10.2 A, overall average for 1,046 interfaces 8.9 A), and compared with the average 
value in the X-ray structures (range 8.7-9.4 A, average 9.0 A) (Fig. 4b). Deviations 
for 99% of the modelled NTD-CTD interfaces were within 1 A of experimental. We 
judge that this level of quality assessment is sufficient at this stage of the modelling. 

As noted previously’’, the NTD-CTD interface is a narrow band of contacts 
mediated primarily by flexible polar side chains and water-mediated hydrogen 
bonds. Although there is a small hydrophobic component, these interactions are 
mediated by flexible aliphatic side chains (for example Met 68, Met 144 and 
Met 215). Thus geometric fit and surface complementarity are expected to be poor 
indicators of model quality in this case. 
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General anaesthetics have enjoyed long and widespread use but 
their molecular mechanism of action remains poorly understood. 
There is good evidence that their principal targets are pentameric 
ligand-gated ion channels’” (pLGICs) such as inhibitory GABA, 
(y-aminobutyric acid) receptors and excitatory nicotinic acetylcholine 
receptors, which are respectively potentiated and inhibited by general 
anaesthetics. The bacterial homologue from Gloeobacter violaceus’ 
(GLIC), whose X-ray structure was recently solved*”, is also sensitive 
to clinical concentrations of general anaesthetics®. Here we describe 
the crystal structures of the complexes propofol/GLIC and desflurane/ 
GLIC. These reveal a common general-anaesthetic binding site, which 
pre-exists in the apo-structure in the upper part of the transmem- 
brane domain of each protomer. Both molecules establish van der 
Waals interactions with the protein; propofol binds at the entrance of 
the cavity whereas the smaller, more flexible, desflurane binds deeper 
inside. Mutations of some amino acids lining the binding site pro- 
foundly alter the ionic response of GLIC to protons, and affect its 
general-anaesthetic pharmacology. Molecular dynamics simulations, 
performed on the wild type (WT) and two GLIC mutants, highlight 
differences in mobility of propofol in its binding site and help to 
explain these effects. These data provide a novel structural framework 
for the design of general anaesthetics and of allosteric modulators of 
brain pLGICs. 

Understanding the mechanism of action of general anaesthetics 
requires the identification of their binding site(s) within the three- 
dimensional structure of pLGICs. To identify such a site, we used 
the pH-gated bacterial homologue GLIC, a homopentameric member 
of the pLGIC family that was recently shown to be sensitive to general 
anaesthetics® and amenable to X-ray structure determination*”. 

Co-crystals of GLIC were grown with propofol and apo-GLIC crys- 
tals were equilibrated in mother liquor saturated with desflurane. 
Diffraction data were collected up to 3.2-A (desflurane) or 3.3-A (pro- 
pofol) resolution. In both structures, strong densities in otherwise 
empty Fourier F, — F, difference maps revealed bound anaesthetics 
in each subunit (mean peak height 8.8 + 0.60 for desflurane and 
5.9 + 0.80 for propofol; Supplementary Fig. 1). Both molecules were 
found to bind in the same region, with little change in the protein 
conformation compared with apo-GLIC’%, a feature observed also for 
some soluble proteins complexed with general anaesthetics’ ’. The 
binding site is located in the upper half of the transmembrane domain 
(Fig. 1a) in a cavity that exists within each subunit of the apo-structure. 
The general-anaesthetic cavity is accessible from the lipid bilayer and 
progressively narrows down towards the interior of the subunit 
(Fig. 1b, c). Another cavity of comparable volume is located at the 
interface between subunits, on the other side of M1. It is not accessible 
from the outside. A narrow tunnel (less than 3 A in diameter) links 
both cavities. 

Residues from a single subunit border the general-anaesthetic cavity 
(Fig. 2), with contributions from M1 (1201, 1202, M205 and L206), M2 


(the back wall, V242), M3 (Y254, T255, 1258 and 1259), M4 (N307 and 
F303, near the mouth), and from the B6-B7 loop (Y119, P120, F121, 
constituting the roof). Desflurane is buried deep inside the cavity and is 
engaged in mainly hydrophobic interactions with M1 (1201, 1202), M3 
(T255 and 1258) and M2 (V242). Its oxygen atom is within hydrogen- 
bond distance of the T255 hydroxyl group. Significant additional elec- 
tron density is observed in the protein neighbourhood and is attributed 
to lipids, with an alkyl chain obstructing the cavity entrance (Fig. 1b), 
as observed with the apo structure. This defines a cavity volume of 
238 A® whereas the volume of desflurane is 94 A®. Propofol lies closer 
to the entrance of the general-anaesthetic cavity and would clash with 
the lipid seen in the apo and desflurane structures. Accordingly, the 
presence of propofol is associated with local acyl chain reorganization. 
Propofol is sandwiched between M1 and M3 and interacts mainly with 
T255 and Y254, by van der Waals contacts (Fig. 2). In the orientations 
that best fit the density maps, the propofol hydroxyl group could form 
a hydrogen bond with Y254. Propofol lies 6 A away from the V242 side 
chain, whereas desflurane is 3.5 A away. 

We have recently shown that GLIC activation is inhibited by most 
general anaesthetics at clinical concentrations’. To check whether the 
general-anaesthetic binding sites contribute to this inhibition, we 
mutated key general anaesthetics-binding residues into alanine, or into 
more bulky residues (mutants 1202A,W,Y, V242M,W and T255A), 
and studied the functional effect by two-electrode voltage-clamp elec- 
trophysiology in oocytes. 

Among the mutants tested, 1202Y and T255A produce a marked 
gain of function, with a tenfold shift of the proton dose-response curve 
towards lower concentrations (pHso = 6.1 + 0.1 with Hill number 
(my) = 2.0+0.2 and pHs) = 6.0+ 0.2 with my = 1.1 40.2 respec- 
tively), compared with WT (pHso = 5.0 + 0.3 with my = 1.8 + 0.3) 
(Fig. 3a, b). T255A shows also slower apparent rate constants for 
activation and deactivation. The other mutants are activated by protons 
ina manner similar to WT, except that V242W yielded no current (Sup- 
plementary Table 2). 

The inhibitory action of general anaesthetics was measured around 
the one-fifth maximum effective concentration (ECz9) of proton 
activation. On WT, propofol and desflurane produced 100% maximal 
inhibition with half-maximum inhibitory concentration (ICs9) values 
of 24+6.3uM (nmy= 1.1402) and 27+13uM (ny =0.3 +02) 
respectively. Screening the mutants for inhibition by 10 uM propofol 
and 500 1M desflurane shows no change when position 202 is mutated, 
but shows that V242M and T255A produce a parallel tenfold shift of 
the propofol dose-inhibition curve to lower concentration (Fig. 3c, d 
and Supplementary Table 3). In contrast, V242M has no effect on 
desflurane inhibition, whereas T255A produces a tenfold shift of the 
desflurane inhibition curve towards higher concentrations. Measure- 
ment of general-anaesthetic inhibition at different pH shows that both 
anaesthetics are more efficient at higher pH, for both WT and T255A 
(Fig. 3e and Supplementary Fig. 2). Strikingly, T255A increases the 
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Figure 1 | Propofol and desflurane binding sites. a, General view of GLIC 
from the plane of the membrane in cartoon representation with a bound 
general-anaesthetic molecule in space-filling representation. The molecular 
surface is represented in the insets and coloured in yellow for the binding 
pocket. b, Cartoon and surface representation of the general-anaesthetic cavity 
seen from the membrane (left) and from the adjacent subunit (right, M1 
removed for clarity) with propofol (green), desflurane (yellow) and lipids of the 
two structures (green and orange respectively) depicted as sticks. For this 
representation Cx atoms were superimposed with a root mean square deviation 
of 0.13 A. ¢, Molecular surface of the general-anaesthetic intra-subunit cavities 
(yellow) and neighbouring inter-subunit cavities (pink) for the whole 
pentamer. In one of the subunits, the communication tunnel between the two 
cavities is depicted in orange, and its constriction indicated by an arrow in the 
inset. 
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inhibition by propofol but decreases the inhibition by desflurane at all 
proton concentrations. Altogether, mutation of selected residues within 
the general-anaesthetic binding site affects (1) the intrinsic ionic res- 
ponse of GLIC, illustrated by the marked gain of function of I202Y and 
T255A, whose phenotypes are similar to that of the canonical 1233(9’)A 
mutation*”°, and (2) the pharmacology of general anaesthetics, illu- 
strated both by V242M, which displays an increased sensitivity to pro- 
pofol but not to desflurane, and T255A, which has an increased 
sensitivity to propofol but a decreased sensitivity to desflurane. 

These data support the hypothesis that the general-anaesthetic bind- 
ing site described here contributes to general-anaesthetic-mediated 
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Figure 2 | Residues of the binding site. Sites for propofol (right) and desflurane 
(left), viewed from the membrane (top panels with M4 helix removed), and from 
the ECD domain (lower panels with ECD removed). Residues bordering the 
pocket and contributing to binding are depicted as blue or red (mutated positions) 
sticks. SigmaA weighted Fourier difference maps 2F, — F. contoured at 1.50 
around the anaesthetics molecules are represented as a blue mesh. 


inhibition of GLIC. For desflurane, mutagenesis data match the char- 
acteristics of its binding site in the X-ray structure well, with no sig- 
nificant effect when the relatively distant positions 202 and 242 are 
mutated, and a strong impairing effect when mutating T255, which 
extensively contacts desflurane, into an alanine. In contrast, for pro- 
pofol, both positions 202 and 255 contact propofol, but only mutation 
at position 255 alters its effect. More surprisingly, position 242 is not in 
direct contact with propofol but V242M modifies its response. These 
data suggest a significant mobility of propofol within the cavity, a 
feature that may be reflected by the high B factors of general anaes- 
thetics in the crystal structure (Baesflurane = 121 A, Byropofol = 135 A, 
mean values), although high B factors and partial occupancy of the site 
cannot be discriminated at 3.3-A resolution. 

To examine this possibility further, we performed 30-ns molecular 
dynamics simulations of propofol bound to the WT protein, T255A, 
V242M and 1202A mutants. At this timescale, propofol remains in the 
cavity, but shows substantial mobility (Fig. 4a). T255A and V242M are 
associated with (1) reduced propofol fluctuation (root mean square 
fluctuation of propofol non-H atoms of 3+1.1A, 2.4+0.8A, 
2.3+0.8A, 2.7A for the WT, T255A, V242M and 1202A runs, 
respectively), (2) deeper penetration inside the cavity (Fig. 4b) and 
(3) more frequent interaction with residue 242 compared with the 
WT and 1202A (data not shown). Altogether, these simulations pro- 
vide complementary interpretations to account for the higher sensiti- 
vity of T255A and V242M to propofol inhibition that could not have 
been deduced from the static structure alone. 

The X-ray structure of GLIC was formerly interpreted in terms of an 
apparently open conformation*”. But general anaesthetics behave as 
inhibitors of the ionic response and are therefore expected to stabilize 
a closed conformation. Our data unravel a general-anaesthetic site in 
the open conformation, and molecular dynamics simulations show that 
propofol and desflurane are stable in this site conformation at the 30-ns 
timescale. This apparent contradiction can be readily explained by a 
non-exclusive (differential) binding of general anaesthetics to the open 
and closed states, with general anaesthetics displaying a higher affinity 
for the closed state than for the open one"". Interestingly, the T255A and 
1202Y gain-of-function phenotypes suggest a structural rearrangement 
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Figure 3 | Electrophysiological characterization of binding-site residues. 

a, Traces of currents evoked by 30-s applications of low extracellular pH 
separated by 30-60 wash. b, Corresponding plots for currents normalized 
with respect to the value at pH 4. Mean + s.d. of 4 to 12 cells per construct. 
c, Inhibition by 0.5 mM desflurane (left) or 10 1M propofol (right) applied for 
60s during the plateau of GLIC activation by a pH near EC29 (ECj9-30). 

d, Corresponding concentration-inhibition characteristics of desflurane (left) 
or propofol (right). e, Current traces showing the effect of 10 1M propofol on 
GLIC currents corresponding to proton EC3 53,59 (WT, left traces) or ECy 432,57 
(T255A, right traces) of each cell. Xenopus laevis oocytes, holding potential —60 
to —40 mV. 


of the general-anaesthetic binding site during gating, in line with its 
location on the backside of the pore-lining M2 helices, at the level of the 
gate, and close to the transmembrane-domain-extracellular-domain 
interface. We note that the hypothetical gating mechanism* previously 
suggested from the comparison of GLIC and ELIC” structures involves 


Figure 4 | Molecular dynamics simulation of propofol bound to GLIC. 

a, View from the ECD domain depicting propofol positions as green sticks, 
superposed at 0.5-ns intervals onto the starting (cartoon and molecular surface) 
and final (transparent cartoon) conformation of the protein, during the 30-ns 
molecular dynamics runs. The scheme on the left explains the colour code, and 
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a strong reorganization of the general-anaesthetic binding site as a 
consequence of M2 and M3 helix tilting. 

Another important feature of the GLIC-general anaesthetic structures 
is that binding occurs to a site where nearby ordered lipids are identified. 
One lipid lying in the crevice between M1 and M4 is observed in the apo 
and desflurane structures and is displaced in the propofol structure. 
These lipids co-purify with the solubilized protein, indicating tight bind- 
ing within protein subsites known to be critical for the transmembrane 
domain structure’*™. It is known that lipids contribute to pLGIC func- 
tion’*"®, and those observed in the present electron-density maps are 
good candidates for such a role. A perturbation of interactions with lipids 
caused by general-anaesthetic binding might thus contribute to the func- 
tional inhibition, suggesting that general anaesthetics may compete with 
endogenous allosteric modulators”, lipids in this case, but also possibly 
fatty acids, cholesterol and/or neurosteroids'* in the case of eukaryotic 
pLGICs. 

It is striking that the pharmacology of GLIC inhibition resembles 
that of nicotinic acetylcholine receptors (nAChRs), which are also 
inhibited by general anaesthetics and are unusually sensitive to volatile 
anaesthetics’’. The general-anaesthetic binding site described here is 
thus a primary candidate for promoting nAChR inhibition. In contrast, 
Gly/GABAg receptors are mostly potentiated by general anaesthetics. 
Experimental data involving chimaeric constructs show that GlyRal 
S267 (M2), A288 (M3) and 1229 (M1) contribute to general anaesthetic 
and alcohols potentiation’*’. The general anaesthetic etomidate 
labelled brain GABA, receptors at residues «1M236 and 03M286"', 
the latter corresponding to GlyRal A288. This labelling was only partly 
inhibited by propofol and neurosteroids”, consistent with an allosteric 
interaction between several binding sites. Overall, the interpretation 
of these data using homology models based on the cryoelectron- 
microscopy nAChR structure at medium resolution’**° suggests that 
intra-subunit and/or inter-subunit sites in the upper part of the trans- 
membrane domain mediate general-anaesthetic modulation of anionic 
pLGICs. 

The sequence of GLIC can be readily aligned with that of GABA,R 
and GlyR (Supplementary Fig. 3). From the GLIC three-dimensional 
structure and this alignment, the three residues identified in GABA,/ 
Gly receptors can be seen to point towards the inter-subunit cavity 
(Fig. 1c) close to the intra-subunit general-anaesthetic binding site 
described here. A marked reorganization of these cavities was observed 
in the initial steps of channel closing during our 1-p1s molecular 
dynamics simulation of GLIC at neutral pH** (Supplementary Fig. 
4). This involves transient communications between the inter- and 
intra-subunit cavities caused by M2 and M3 motions, which further 
supports the notion that the shape and volume of the cavity are coupled 
to channel gating, and could suggest that a cross-talk between both 
cavities might underlie general-anaesthetic-mediated potentiation. 

pLGIC and particularly nAChRs are not only the target of general 
anaesthetics” but also of natural and synthetic allosteric modulators 
that are developed for their therapeutic potential”*”’. Mutational data 
suggest that ivermectine”’ and PNU- 120596, which behave as positive 
| Wild type | T255A 


V242M | 1202A 


16 22 28 32 
Distance to pore centre (A) 


the three panels correspond to the WT, T255A and V242M systems. b, Distance 
distribution between the propofol centre of mass and the pore centre. The 
distribution is shifted closer to the pore centre for the mutants showing an 
increased inhibition. 
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allosteric modulators of «7 nAChR, bind at a location resembling that 
observed in our structures. The general-anaesthetic binding site unravelled 
herein thus provides a novel template for the design of allosteric modu- 
lators inhibiting or potentiating pLGICs. 


METHODS SUMMARY 


GLIC production’, electrophysiology’ and molecular dynamics” were performed 
as described (full methods in the Supplementary Information). Crystals were 
typically grown in 12-16% PEG 4000, 400 mM NaSCN, 100mM Na-Acetate at 
pH 4, in the presence of an excess of general anaesthetics. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein production. The protein was produced and purified as described previ- 
ously**”° with a few variations. The GLIC protein was overexpressed in 
Escherichia coli C43 cells, and expression was induced with 0.1 mM isopropyl 
B-p-1-thiogalactopyranoside (IPTG) at absorbance (Ago nm) = 1 overnight at 
20°C. Cells were mechanically lysed in buffer 1 (Tris 20mM pH 7.6, NaCl 
300 mM, with proteases inhibitors from Roche); membranes were isolated by 
ultracentrifugation. The proteins were extracted from membranes with 2% 
DDM (Anatrace) under agitation at 4 °C, and the solubilized fraction was cleared 
by ultracentrifugation. Solubilized proteins were first purified by affinity chro- 
matography on an amylose resin. After extensive wash, in buffer 1 supplemented 
with 0.1% DDM, the MBP-GLIC fusion protein bound to the resin was cleaved 
overnight at 4 °C under gentle agitation, in the presence of three units of thrombin 
(Calbiochem) per 50 jig of protein and of 2 mM of CaCl. The digested protein was 
eluted in buffer 1 supplemented in 0.02% DDM and concentrated. It was then 
subjected to size exclusion chromatography on a Superose 6 10/300 GL column 
(GE Healthcare) equilibrated in the same buffer. Fractions of the peak correspond- 
ing to the pentamer were pooled and concentrated for crystallization experiments. 
Crystallography. GLIC was crystallized using the vapour diffusion method in hang- 
ing drops at 20 °C. The concentrated (8-12 mg ml *) protein was mixed ina 1:1 ratio 
with reservoir solution containing typically 12-16% PEG 4000, 400 mM NaSCN and 
0.1 M NaAcat pH 4.0. Crystals of the protein grew overnight. A saturating amount of 
desflurane was then added in the well (typically the well of a Linbro plate was 
completely filled). After 2-5 days of equilibration the crystals were rapidly transferred 
in the mother solution supplemented with 20% glycerol for cryoprotection and flash- 
frozen in liquid nitrogen. Pure propofol was added at the time of crystallization right 
after the mixture of the protein solution with the mother liquor. An emulsion was 
formed and small crystals grew in 2 days, nearby the propofol droplets included in the 
crystallization drops. Crystals were flash-frozen in the usual manner. 

A great number of data sets of frozen single crystals were collected on beamlines 
Proxima-I of the Soleil Synchrotron and ID 14 of the European Synchrotron Radiation 
Facility and processed with XDS* and CCP4 (ref. 32) programs. Crystals were iso- 
morphousto the WT ones, with a C2 space group and one pentamer in the asymmetric 
unit. Non-crystallographic symmetry (NCS) restraints were used throughout the 
refinement performed by alternate cycles of manual building in COOT® and auto- 
matic refinement using REFMAC* and BUSTER”. Difference Fourier F, — F- 
maps were checked for strong signals indicating the presence of a ligand. In the 
two data sets presented here, bound anaesthetics corresponded to 5-90 peaks 
(Supplementary Fig. 1), depending on subunits. The peaks were present in each 
subunit in a region devoid of any density in other data sets. Moreover, the electron 
density for the known bound detergents in the pore* appears below or at this level 
(data not shown), namely 6o for the most ordered part of the detergent, which 
constitutes an intrinsic positive control for the presence of anaesthetics. At 3.2- to 
3.3-A resolution it may not be justified to model the bound molecules individually 
as this will result in a different orientation in each subunit. For propofol we used 
fivefold NCS-averaged maps to build the molecules. For desflurane we also used 
NCS maps; in addition, both plausible orientations were tried and the one with the 
trifluoromethyl group at the bottom of the cavity was selected by comparing dif- 
ference maps after refinement and independent docking scores (data not shown). 

Lipids surrounding the anaesthetics binding site were partly modelled, in NCS- 
averaged maps. Identification of the chemical nature of the lipids is not possible 
with such maps at this resolution and thus we arbitrarily used phosphatidyl- 
choline. As in the apo- protein model, the acyl chains are more ordered than 
the polar heads. One small part of an acyl chain corresponding to a second layer 
of lipid with no direct interaction with the protein is present. Structural analysis 
and figure preparation were done with PyMOL**, VMD” and Molprobity™. 
Refinement parameters for ligands were generated with the PRODRG server”. 
Electrophysiology. Conditions for electrophysiological experiments were as fol- 
lows (apart from a few variations® in experiments using desflurane). 

Cell injection. Defolliculated, stage VI" X. laevis oocytes were obtained from a 
commercial supplier one day after ovary dissection. DNA (<2 ng) was blind injected 
into the nucleus through the animal pole, using a pneumatic microinjector, as a 
mixture in water of GLIC cDNA in a pmt3 vector (0.08 g]') and green fluorescent 
protein (GFP) cDNA in the same vector (0.02 g]~ asa reporter gene for successful 
intranuclear injection. Identified cells were kept in 96-well plates with U-shaped 
bottom, in a HEPES-buffered modified Barth’s” solution (in mM: NaCl 88, KCI 1, 
NaHCO, 2.4, HEPES 20, MgSO, 0.82, Ca(NOs)> 0.33, CaCl, 0.41; pH 7.4; 0.22 um 
filtered), at 18 °C for two days and then at 15°C. Oocytes with the T255A mutant 
were transferred to pH 8 Barth’s solution 2 days after injection. GFP-positive oocytes 
selected 2 days after injection were recorded 2-6 days after injection. 

Electrophysiological recordings. Oocytes” were superfused with the animal pole 
facing a gravity driven solution inflow (4-8 ml min”) ina corridor-shaped recording 
chamber (flow section <10 mm/?). A solution of (in mM) NaCl 100, KC13, CaCl 1, 


MgCl, 1 and 2-morpholino-ethanesulphonic acid (MES) 10 was adjusted to pH 8.0 
with NaOH, and used as intertest on T255A oocytes. The control extracellular pH 
(7.3) and lower pH values were reached by adding HCl 2 mol] | and any extra pH 
8.0 solution. The whole oocyte plasma membrane was voltage clamped 
(GeneClamp 500, Axon Instruments/Molecular Devices) using two intracellular 
pipettes filled with 3 M KCl (0.8-1.5 MQ), and distinct current and voltage extra- 
cellular electrodes separately bridged to the bath near suction using 5 gl’ agar in 
3M KCL. Currents were recorded (pClamp8, Axon Instruments) at air-conditioned 
room temperature (21-23 C), acquired at 500 Hz after low-pass filtering (200 Hz), 
and further filtered using 100- to 1-mean sample data reduction for figure display. 
Proton concentration-response curves were established at a holding potential of 
—50mV, using manually controlled 30-s test-pH applications (or shorter when 
desensitization/inhibition at low pH produced a peak current) separated by 30- to 
60-s wash at pH 7.3, or 8.0 for the T255A GLIC mutant. 

Preparation of general-anaesthetic solutions. Desflurane was obtained from Baxter 
Healthcare Corporation. Desflurane solutions were made up gravimetrically, in gas- 
tight ground-glass syringes, and vigorously agitated. The final concentration was 
spot-checked by headspace gas chromatography. Propofol (2,6 diisopropylphenol) 
was obtained from Aldrich (W50,510-2). Propofol stock solution was made by 
dissolving pure oily liquid propofol at 1 moll~’ in dimethyl sulphoxide (DMSO). 
It was kept in glass in the dark at room temperature for up to 1 month, and diluted in 
the recording solution less than 2 h before use on each cell, under strong agitation to 
0.1mM and then to lower concentrations. For most of the recordings, a single 
concentration of propofol was present in the perfusion system, and each cell received 
a single application of propofol. The perfusion system was extensively cleaned and 
partly replaced before going from high to low propofol concentrations. 

Fit of data and statistics. Data in Fig. 3b-e are presented as mean + s.d. Plots 
shown in Fig. 3b, d of log(concentration/mol |” ') versus mean effect were fitted with 
a sigmoid function. Parameters given in Supplementary Table 2, and in Sup- 
plementary Table 3 for desflurane, were obtained by fitting for each cell a plot of 
concentration/mol |” ' versus normalized effect over four or five orders of magnitude 
with the Hill equation, giving values of ny and half-maximum effective concentration 
(ECs) (proton) or ICs9 (desflurane) for individual cells, of which mean = s.d. values 
are shown. Parameters given in Supplementary Table 3 for propofol were obtained by 
Hill-fitting a scatter plot of individual concentration/moll~! versus percentage 
inhibition data points obtained from all the cells tested (one data point per cell in 
most cases); yy and ICso are given with the standard error of the parameters deter- 
mined from the nonlinear regression. Data in Supplementary Fig. 2 were empirically 
fitted with a straight line, or a simple exponential decay, to improve readability. 
Molecular dynamics. We used a full atomic model of GLIC at pH 4.6 derived from 
previous simulations” to add general-anaesthetic molecules at the positions deter- 
mined in the crystal structures presented in this work. The protonation state was 
assigned similar to previous simulations on the basis of pK, calculations with the 
Yasara software** to represent the most probable pattern at pH 4.6, with residues 
E26, E35, E67, E75, E82, D86, D88, E177 and E243 being protonated. H277 was 
doubly protonated. The model was inserted in a fully hydrated palmitoyl-2-oleoyl- 
sn-glycerol-phosphatidylcholine (POPC) lipid bilayer (307 lipids, approximately 
44,000 water molecules) leading to an initial system size of 128 Ax 125A X 182A. 
The net charge of the system was neutralized with 54 Na+ and 89 Cl counterions, 
achieving a salt concentration of about 100 mM. These steps were performed within 
VMD”, using the psfgen, membrane, solvate and autoionize plug-ins. Similar models 
were derived for the 1202A, V242M and T255A mutants. The simulations were 
performed with NAMD“ using the CHARMM27 (ref. 45) force field. Parameters 
for propofol were provided by G. Brannigan’. 

A short equilibration was performed by minimizing the system for 1,000 steps. This 
was followed by 30 ns of production runs. Simulations were performed at 310 K using 
Langevin dynamics with a damping coefficient y of 1 ps’ for temperature control. A 
Langevin piston algorithm was used to maintain the pressure at 1 atm. A short 10-A 
cutoff was used for non-bonded interactions. A smooth switching function was used 
for van der Waals interactions between 8.5 and 10 A. Long-range electrostatic inter- 
actions were treated using the particle mesh Ewald method”. The r-RESPA multiple 
time step method” was used with a 2-fs time step for bonded and for short-range non- 
bonded interactions, and a 4-fs step for long-range electrostatic forces. All bonds 
between hydrogen atoms and heavy atoms were constrained with the SHAKE algo- 
rithm. All molecular dynamics simulations were performed on Vargas, an IBM 
Regatta Power6 machine at the Institut du Développement et des Ressources en 
Informatique Scientifique (IDRIS) Supercomputer Center in Orsay (France). 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature09734 


Tumour vascularization via 
endothelial differentiation of 
glioblastoma stem-like cells 


Lucia Ricci-Vitiani, Roberto Pallini, Mauro Biffoni, 

Matilde Todaro, Gloria Invernici, Tonia Cenci, Giulio Maira, 
Eugenio Agostino Parati, Giorgio Stassi, Luigi Maria Larocca 
& Ruggero De Maria 


Nature 468, 824-828 (2010) 


In Fig. 3b of this Letter, the black bar was inadvertently labelled as 
mCD31 instead of hCD31 (human CD31). The corrected Fig. 3b is 
shown below. 
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Researching 
outside the box 


Open innovation offers scientists novel ways to apply their 
expertise — and sometimes provides much-needed cash. 


BY CRISTINA JIMENEZ 


couple of months from finishing a mas- 
ter’s degree in molecular genetics, and was 
thinking about what to do next. During a cof- 
fee break, he read a news article about a com- 
pany that posted other companies’ unresolved 


I: March 2008, Ahmet Karabulut was a 


scientific and business problems online. Any- 
one could send in a solution. Intrigued, Kara- 
bulut sought out the website and applied. He 
ended up successfully inventing a solution to 
an organic-chemistry problem on the stability 
of a compound in a common nasal decongest- 
ant. Karabulut used little more than his prior 
knowledge of the topic and access to the 


scientific literature. His immediate reward for a 
couple of days of work was US$20,000, crucial 
support as he was between jobs. But Karabulut 
says that the experience did more than earn him 
money — it also enriched his career. 

He had answered the call of InnoCentive, an 
organization based in Waltham, Massachusetts, 
that uses the Internet to link ‘seekers’ — client 
companies struggling with pressing scientific 
or business problems — with ‘solvers’ — more 
than 250,000 problem-cracking minds around 
the globe, InnoCentive claims. The company 
is one of several, most of which are based in 
the United States, that are engaged in ‘open 
innovation. The motivating principle behind 
open innovation is that companies and other 
institutions should take full advantage of widely 
distributed knowledge in a wired world, finding 
products, patents and solutions — scientific, 
technological or social — outside the confines 
of their own organizations. 

The trend has its own lingo: InnoCentive and 
others, including IdeaConnection of Vancouver, 
Canada, NineSigma of Cleveland, Ohio, Top- 
Coder of Glastonbury, Connecticut, and yet2. 
com of Needham, Massachusetts, are known as 
innovation intermediaries, facilitators or tech- 
nology brokers. The particulars of the organiza- 
tions vary, but all pursue solutions to pressing 
research problems by posting challenges online, 
with the promise of financial awards — essen- 
tially crowdsourcing. Seekers range from drug 
to oil companies, and government agencies to 
non-profit organizations, and typically offer sev- 
eral thousand dollars for a project with a short 
turnaround. For scientists in need ofa short- 
term financial boost or a supplemental source 
of income, open-innovation opportunities can 
offer a novel and challenging way to tackle new 
topics and add credentials to their CVs. 


FRINGE BENEFITS 

Since his first foray into open innovation, Karab- 
ulut has solved two further challenges, including 
developing a novel method for membrane-pro- 
tein expression. He now works in drug discovery 
at the Fred Hutchinson Cancer Research Center 
in Seattle, Washington. “I was glad that I put the 
awards in my CV, he says, noting that he started 
to get many more responses to job applications 
once he had done so. Karabulut believes that the 
challenges showed that he could troubleshoot 
and think creatively, skills that many employ- 
ers value. Henry Chesbrough, executive director 
of the Center for Open Innovation at the Uni- 
versity of California, Berkeley, agrees. “Even 
those who do not ‘wir’ often develop their > 
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reputation as an expert, making them more 
attractive within their innovation community 
for hire as a consultant or an employee for some 
future activity, he says. Chesbrough coined and 
promoted the term ‘open innovation in his book 
Open Innovation: The New Imperative for Cre- 
ating and Profiting from Technology (Harvard 
Business School Press; 2003). 

Scientists interested in becoming solvers can 
typically sign up for free on the intermediary's 
website. Usually facilitators don’t require poten- 
tial solvers to have formal qualifications, but 
InnoCentive estimates that 61% of their solvers 
hold an advanced degree. After signing in, can- 
didate solvers can browse through numerous 
challenges in categories such as global health, 
clean technology or aerospace. InnoCentive, 
for example, runs about 40 challenges at a time. 
Solvers tend to choose to work on challenges 
that they think they can solve, says Lars Bo 
Jeppesen, an associate professor at Copenha- 
gen Business School who has been studying the 
nature of innovation in a world of distributed 
knowledge for the past 10 years. This natural 
self-selecting method allows “the right people 
to solve the right problem at the right time’, says 
Dwayne Spradlin, chief executive of InnoCen- 
tive. One major consumer goods product com- 
pany claims that it has awarded prizes for more 
than half ofits posted challenges. As is common 
practice, neither InnoCentive nor the company 
would provide specifics, fearing revealing too 
much to competitors. 

So what does it take to become a successful 
solver? Some solvers emphasize the importance 
of a diverse research and training background 
and the ability to apply solutions and tools 
between fields or in a new one. “In fields that I 
haven't mastered, I can think outside of the box; 
Iam not limited by the known and unknowns or 
the rules of the field” says Mounir Errami, a bio- 
chemist and bioinformatician at the University 
of Texas Southwestern Medical Center in Dal- 
las. He has solved three challenges at InnoCen- 
tive, developing technologies for medicine and 
cosmetic chemistry unrelated to bioinformat- 
ics. Yury Bodrov, an organic chemist who has 
solved multiple challenges at InnoCentive and 
IdeaConnection, identifies two major assets for 
solvers. One is an analytical mind that can dis- 
tinguish important data from noise. “The sec- 
ond is not a skill” says Bodrov, an independent 
consultant. “It’s imagination.” 


INNOVATION COMPENSATION 

Compensation ranges from $5,000 to $50,000, 
depending on the type of challenge. Sometimes, 
a few days’ work can earn the solver several 
thousand dollars. At other times, however, the 
reward may not be worth the time and effort. 
Errami says that in the case of one challenge he 
took on, the amount of work needed has not 
been in line with the potential compensation. 
For past efforts, he says, he has typically required 
a short time to devise a solution, followed by 
a bit more to sort out technical aspects with a 


collaborator. “If the 
solution takes more 
than a couple of days, 
I avoid competing 
because the odds of 
winning are slim,” he 
says. At InnoCentive, 
ifa solution meets only 
some of the seeker’s 
criteria, a client-serv- 


“Even those 


. ices team judges how 
who don’t ‘win’ close it is. The seeker 
develop their then decides whether 
reputationds an  toissuea fullor partial 
expert,making —_ award, ornoneatall. 
them more But the benefits 
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can boost a solver’s 
confidence in his or her problem-solving 
skills, a benefit for those aspiring to academia 
or entrepreneurship. Simone Sergi, a telecom- 
munications engineer and network system 
administrator at a bank in Reggio Emilia, Italy, 
with a doctorate in digital communications, 
won an InnoCentive challenge on reorganizing 
communications among satellites. He says that 
it gave him the confidence to consider a long- 
time dream of launching a wireless-technologies 
start-up company. Sergi says that he does not 
enjoy his job, and relished the opportunity to 
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Scientists who moonlight as solvers can use 
novel means to explore long-standing prob- 
lems. Chris Wilmer, a PhD student in chemi- 
cal and biological engineering at Northwestern 
University in Evanston, Illinois, solved a chal- 
lenge that involved the lack of access to clean 
water in poor villages of developing coun- 
tries. He found inspiration in the success of a 
mobile-phone business run by the Grameen 
Bank in Dhaka, Bangladesh. The bank would 
lend phones to a local entrepreneur in a poor 
village, who would pay back the loan by charg- 
ing others in the village to use a phone. It was a 
profitable and self-sustaining model. Wilmer 
described how a similarly structured safe-water 
business could yield humanitarian benefits. “I 
enjoy solving social problems, so it was fun,” 
says Wilmer, who has also won a second chal- 
lenge involving programming a software tool to 
help characterize synthetic DNA strands. 

For some, pursuing open innovation is a way 
to achieve career independence and flexibility. 
Grace Kepler, part-time associate research pro- 
fessor at the Center for Research in Scientific 
Computation at North Carolina State Univer- 
sity (NCSU) in Raleigh, sees the challenges as 
well suited to scientists who, like her, are not 
employed full-time and have family responsi- 
bilities — as well as those who need more money 
or are unencumbered by intellectual-property 
issues. She applied her mathematical and statis- 
tical skills to a computational-biology challenge, 
and now also works as a scientific consultant, 
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helping the company that posed the challenge 
to model crops. But scientists employed full- or 
part-time should tread carefully. When Kepler 
first started to work asa solver for InnoCentive, 
she had to obtain a waiver from NCSU stating 
that the university had no claim on the intel- 
lectual property in her solution, and that she 
had not designed it in the context of her work at 
the university. Seeker companies are careful to 
avoid revealing much to competitors — solvers 
often don't even know which company they’re 
working for. 

Open innovation could also offer scientists in 
lesser-known locations or institutions the power 
to break down barriers. Intermediaries solicit 
ideas from scientists all over the planet and from 
all sorts of backgrounds; an Internet connection 
is typically the only requirement, says Karim 
Lakhani, a professor of technology and opera- 
tions management at Harvard Business School 
in Boston, Massachusetts. He calls solvers the 
“unusual suspects” — scientists and others with 
a potentially unconventional perspective. 


NEW FUNDING MODEL 
It is not just individuals who stand to benefit 
from an open-innovation funding stream; uni- 
versities seeking a bit of extra money could also 
reap rewards. The National Physical Labora- 
tory (NPL) in Teddington, UK, discovered 
open innovation 
while investigating 
alternative funding 
paths. “For the past 
three years, we've 
been trying to reduce 
our reliance on tradi- 
tional government- 
funded routes for 
collaborative R&D, 
says Matt Smith, 


business develop- “Companies 
ment manager at the need to adap t 
NPL, adding thatthe towork with 
laboratoryhasearned ©X ternalideas 
more than £300,000 andinventors.” 
(US$467,000) Wim Vanhaverbeke 
through NineSigma, 


which focuses on pairing organizations — 
whether inventors, start-ups or universities 
— that possess innovative technology with com- 
panies that can commercialize those inventions. 
And when the National Aerospace Laboratory 
of the Netherlands (NRL) in Amsterdam was 
undergoing budget cuts, it secured a lucrative 
industrial research project through NineSigma 
to develop a life-assessment model for gas tur- 
bines. “The project lasted for 15 months and 
was worth several hundred thousand euros,” 
says Arjen Vollebregt, a department manager 
at the NRL. He thinks that companies such as 
NineSigma could help labs to get extra funding 
with relatively little effort. 

Even a government agency, steeped in 
bureaucracy and decades of tradition, may have 
something to gain from open innovation — and 
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researchers eager to work with that agency 
could benefit. Bodrov solved a challenge for 
NASA on keeping food fresh in space, and 
he is now an independent scientific consult- 
ant for the chemical industry. He is also an 
entrepreneur: the approximately $160,000 
that Bodrov earned from solving 16 chal- 
lenges at InnoCentive and IdeaConnection 
has helped him to launch a start-up that 
develops nanomaterials for drug delivery in 
Saint Petersburg, Russia. 

Jeffrey Davis, director of NASA's Space 
Life Sciences Programme at the Johnson 
Space Center in Houston, Texas, says that 
the agency liked Bodrov’s food-packaging 
idea because it used a flexible graphite mate- 
rial — a solution perhaps familiar to materi- 
als scientists, but one that the food industry 
would not have generated. NASA is now 
looking at creating its own problem-solving 
framework, blending open-innovation chal- 
lenges with traditional grants, contracts and 
small-business proposals, says Davis. 

Davis says that accessing open-inno- 
vation channels was easy. “But there was 
a psychological barrier to admitting we 
couldn't find the answers ourselves,” he 
says. This is not uncommon. Hesitation 
to accept outside inventions — a ‘not- 
invented-here’ stigma — is one of the major 
obstacles for open-innovation mecha- 
nisms, says Wim Vanhaverbeke, a profes- 
sor of business studies at Hasselt University 
in Diepenbeek, Belgium. Companies must 
confront the same barrier. “They need to 
forget the idea that their only mission is 
to protect inside inventions and adapt to 
work with external ideas and inventors,’ 
says Vanhaverbeke. 

The open-innovation approach contin- 
ues to evolve. IdeaConnection, formed in 
2007, is attempting to form teams of solvers 
using the extensive information collected 
in their online applications; each team 
member receives equal compensation if 
they win the challenge. Although teams 
composed of members with complemen- 
tary backgrounds might have a better 
chance of solving challenges, efficiently 
communicating ideas between disparate 
members can bea challenge — something 
team facilitators attempt to address. Inno- 
Centive, meanwhile, is creating a sort of 
‘dating site’ for scientists, so that they can 
choose who they want to work with. 

Karabulut says that the open-innovation 
strategy still has plenty of room to grow, “I 
don't know any better way for ‘seekers’ to find 
global talent for very specific challenges,” he 
says. The thrill of winning continues to bea 
big part of the appeal. “It is one thing to win 
acash reward,’ says Errami. “But it is quite a 
feeling to win a challenge? m 


Cristina Jiménez is a freelance writer 
based in Barcelona, Spain. 
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Confessions of a 
procrastinator 


Everyone puts off big tasks with smaller ones, and the 
only solution is to fight fire with fire, says Fabio Paglieri. 


Jerome’s 1889 novel Three Men in a Boat, 

the narrator diagnoses himself with nearly 
every possible ailment after leafing through a 
medical book found in the British Museum. 
Psychology researchers such as myself are 
prone to a special brand of hypochondria: 
like Jerome’s character, I cannot help but 
wonder whether I suffer from some of the 
psychological shortcomings that I observe 
each day in the lab. 

My work studying how people schedule vari- 
ous tasks over time (usually inefficiently) has 
shown me the error of my own organizational 
ways, and now I know the name 
of my terminal illness: procras- 
tination. I am always struggling 
to stick to multiple deadlines 
on the most disparate jobs. For 
every project with a deadline 
that I manage to meet, there 
are two more that I am forced 
to postpone. I am a pathologi- 
cal procrastinator. 

For some time I thought 
I was alone in my depravity, 
and I laboured to keep it hid- 
den from family, friends and co-workers. 
Then it dawned on me: procrastination is no 
exotic malaise, but rather a pandemic virus, 
one possessed of alarming virulence in the 
research community. Colleagues never tire 
of mentioning “bottomless to-do lists’ ‘over- 
whelming commitments, “busy schedules’ 
and ‘pressing deadlines. Such symptoms 
can result in students failing to deliver data, 
a co-author unable to complete a paper ora 
publisher postponing a manuscript’s publica- 
tion. Clearly I am in no position to judge, as 
I myself have committed similar misdeeds. I 
take some heart in sharing the guilt with so 
many others. 

How might young scientists manage to 
avoid wrecking their careers despite such a 
character flaw? Procrastination often stems 
from over-commitment, so simply taking on 
fewer obligations might solve the quandary. 
But this is easier said than done, especially for 
a postdoctoral researcher. One never knows 
which project might turn out to be a means to 


I: a memorable passage from Jerome K. 


new career avenues or to tenure. And by the 
time one realizes that a new task is just another 
time-consuming burden, it is often too late to 
retreat without repercussions. 

I was about to give in to despair and start 
roaming the self-help aisle of my favourite book- 
store in search of a cure when I found a possi- 
ble solution at structuredprocrastination.com. 
On the site, John Perry, a professor of phi- 
losophy at Stanford University in California, 
notes that procrastinators are never really idle; 
instead, they work on something in order to put 
off doing something else. According to Perry, 
you can make procrastination work for you. Just 
convince yourself that there is 
something really complex and 
important that you intend to 
do (say, write a full monograph 
on your favourite research 
topic), and your procrastina- 
tion instinct will immediately 
drive you to do other tasks as a 
way of putting off working on 
your big project. The trick is to 
make sure that these other tasks 
are productive and not a waste 
of time. The bigger your ulti- 
mate aim, the more likely you are to take part in 
useful procrastination chores such as running 
experiments, tutoring students, writing articles 
or going to conferences. 

If Perry is right, you don’t have to conquer 
your base procrastination impulse to progress 
in your professional life. True, a modicum 
of self-deception is required for the strategy 
to work. But fortunately, procrastinators are 
skilled self-deceivers anyway. 

Will it work? It has for me so far. I have man- 
aged to diligently complete many small but 
important tasks as a way of putting off other 
impending obligations. And, unfortunately, 
the alternative is to conquer procrastination 
by sheer willpower, which is something that 
humans just aren't very good at. m 


Fabio Paglieri keeps a Postdoc Journal 
at go.nature.com/3fttcj and is a postdoc 
in cognitive psychology at the Institute for 
Cognitive Science and Technologies of the 
National Research Council in Rome. 
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LAST OF THE GUERRILLA GARDENERS 


BY DAVID L. CLEMENTS 


r | Ahey came for ‘Percy Thrower’ last 
night. I was on my way to deliver 
some Pink Brandywine tomato seeds 

when I saw the first police car. I turned the 
corner and saw a fleet of them parked out- 
side her house, complete with sniffer dogs 
and a space-suited forensic team heading for 
her potting shed. 

I averted my eyes and walked past on the 
opposite side of the road, feeling the enve- 
lope of illegal seeds in my pack broadcast my 
guilt. As I left her road, the sterilization van 
arrived, its flame throwers ready to destroy 
‘Percy’s irreplaceable collection of plants. 

I got away. The others weren't so lucky. 
As I waited for the bus I checked our secure 
server and realized they were rolling up the 
whole network. ‘Monty’ had been the first, 
but in catapulting a package of herb seeds 
into Buckingham Palace gardens hed gone 
too far. His arrest had been the trigger for 
raids across the country. ‘Bob’ had sent out a 
warning as they smashed down his door, but 
they been ready for us all. If] hadn't been on 
a delivery run theyd’ve caught me as well. 

I couldn't go home. Most of the people I 
trusted had been picked up. I stayed on the 
bus as it passed my stop and headed into cen- 
tral London. The clean-up crews were obvi- 
ous, torching collections of wild flowers in 
the roadside beds that Id seeded from bus 
windows while commuting. 

All the hard work, all the beautiful, irre- 
placeable diversity, stamped out by commer- 
cial greed. If !'d’ve had the machinery with 
me I'd’ve leapt off the bus and seeded 
the palace gardens myself. 

‘Percy’ had started 
the whole thing with a 
few prophetic words: 
“Biology is the biggest 
peer-to-peer copy- 
ing system on the 
planet. Now they've 
eliminated file shar- 
ing they'll come for 
the seed sharers.” 

Shed been a univer- 
sity botanist for years but 
left when it became clear 
that all the grants were con- 
trolled by big agribusiness. 
We knew we were in trouble 
when Kew was sold off and 
Henry Doubleday broken up. 
Their vast seed collections became 


~ 


x 


Seeding a revolution. 


the intellectual property of a few huge corpo- 
rations. Unlicensed seeds were already illegal 
to sell, but once companies owned the rare 
strains, they stopped collectors sharing them 
for free. They wanted to control it all. 

At first we tried to stop them. There were 
protests, lobbies and mass marches. Garden- 
er’s Question Time became such a political 
hot potato it was cancelled by the BBC. And 
then came the Chelsea Flower-Show Riots. 

When I got off the bus I saw the police at 
the station. But it was just the usual patrols, 
not yet a manhunt. Maybe word of my escape 
had yet to reach them. I headed for Left 
Luggage. 

We were called economic terrorists, 
threatening profits from high-cost, high- 
yield, terminator-gene strains that would 
feed the world and soak up excess CO,. But 
we just wanted tasty vegetables from our own 
gardens, unusual flowers smelling as good as 
they looked, and the opportunity to eat the 
occasional purple carrot. Serious action only 
came when self-propagating super-plants 
were found growing by a road in Norfolk. 

“Businessmen don't understand that biol- 
ogy is a lot messier than digital copying,” 
‘Percy’ had said as we talked in her potting 
shed. “There’s a dozen perfectly natural ways 
the terminator gene might have failed. One 
cosmic ray taking out the right base pair 
would be enough!” But scientific sense was 
never going to stand up to irate politicians 
shouting “Something must be done!” 
Fines became prison sentences, the 
Seed Squads were established and 
we were forced underground. Home 
gardens were no longer safe, so we 
became guerrilla gardeners — a 
secret society sharing seeds 
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and planting contraband NATURE.COM 
crops in public spaces.  FollowFutures on 
We tended them at night — Facebook at: 

or just scattered seeds far _go.nature.com/mtoodm 


and wide to let nature 
take its course. That’s when the network 
started and we adopted our noms de vert. 

We were too successful. Nature was indeed 
the great copier. Our wilder strains could 
fend for themselves and started to spread. 
The gloves finally came off when the direc- 
tor of Smaxo’s agricultural division found a 
clump of illegal Afghan Purple carrots grow- 
ing at the bottom of his garden and carpeted 
the Prime Minister. Of course, ‘Monty’ and 
his catapult didn't help. 

Now the only guerrilla gardener left 
is me. 

I collected my escape stash from Left Lug- 
gage along with Monty’s seed catapult. The 
wig, hat and glasses helped me slip past the 
tighter police patrols and onto the sleeper to 
Fort William. Locked in my cabin I shaved 
my distinctive beard and used clippers so 
that I'd match the fake ID in my stash. The 
train would go through a lot of isolated coun- 
try. The clean up crews couldn't cover all that 
ground in one season, so some of my seeds 
were going out of the window. 

As for the rest... 

There are islands off the Scottish coast 

contaminated by bioweapons 

testing from the Second 
World War. People are 
forbidden and there 
are no sheep or rab- 
bits. But their climate 
is ideal. The seed 
catapult has enough 
range to reach them 
from a boat offshore. 

Or I could land and 

make certain they're 
properly planted. The 
seeds will do well on the 
islands, even ifI don't. In 

a few years they'll become 

areserve for natural, non- 
commercial diversity no 
matter what happens to me, 
the last guerrilla gardener. m 


David L. Clements is an 
astrophysicist at Imperial 
College London where he 
works on the Herschel and Planck 
projects, and occasionally writes 
science fiction. He doesn't have a garden. 
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Structure of human O-GlcNAc transferase and its 
complex with a peptide substrate 


Michael B. Lazarus’**, Yunsun Nam?"**, Jiaoyang Jiang’, Piotr Sliz”* & Suzanne Walker* 


The essential mammalian enzyme O-linked {-N-acetylglucosa- 
mine transferase (O-GlcNAc transferase, here OGT) couples meta- 
bolic status to the regulation of a wide variety of cellular signalling 
pathways by acting as a nutrient sensor’. OGT catalyses the transfer 
of N-acetylglucosamine from UDP-N-acetylglucosamine (UDP- 
GlcNAc) to serines and threonines of cytoplasmic, nuclear and mito- 
chondrial proteins’’, including numerous transcription factors*, 
tumour suppressors, kinases’, phosphatases’ and histone-modifying 
proteins®. Aberrant glycosylation by OGT has been linked to insulin 
resistance’, diabetic complications®, cancer’ and neurodegenerative 
diseases including Alzheimer’s'®. Despite the importance of OGT, 
the details of how it recognizes and glycosylates its protein sub- 
strates are largely unknown. We report here two crystal structures of 
human OGT, as a binary complex with UDP (2.8 A resolution) and 
asa ternary complex with UDP and a peptide substrate (1.95 A). The 
structures provide clues to the enzyme mechanism, show how OGT 
recognizes target peptide sequences, and reveal the fold of the unique 
domain between the two halves of the catalytic region. This informa- 
tion will accelerate the rational design of biological experiments 
to investigate OGT’s functions; it will also help the design of 
inhibitors for use as cellular probes and help to assess its potential 
as a therapeutic target. 

The ability to sense and respond to nutrient levels is critical for the 
growth of all living systems. In eukaryotes, a major mechanism for 
nutrient sensing involves the essential'' protein glycosyltransferase 
OGT, which senses cellular glucose levels via UDP-GlcNAc concentra- 
tions, and responds by dynamically O-GlcNAcylating a wide range of 
nuclear and cytoplasmic proteins”"*. These include proteins involved in 
insulin-like signalling pathways’ and transcriptional activators that 
regulate glucose levels by controlling gluconeogenesis’. As many 
known O-GlcNAcylation sites are also phosphorylation sites, OGT is 
proposed to play a major role in modulating cellular kinase signalling 
cascades'*. OGT is also involved in widespread transcriptional regu- 
lation’*"’”. Prolonged hyperglycaemia, such as occurs in diabetes, or 
excessive glucose uptake, such as occurs in cancer cells, results in hyper- 
O-GlcNAcylation of cellular proteins by OGT, and this increased 
O-GlcNAcylation has been linked to harmful cellular effects'*. Thus, 
strategies to modulate OGT activity may have therapeutic value for 
treating diabetic complications, cancer, and other diseases”. 

The lack of a crystal structure has been a major impediment to 
investigating OGT’s molecular mechanisms, understanding substrate 
recognition, and developing inhibitors. OGT comprises two distinct 
regions: an N-terminal region consisting of a series of tetratricopeptide 
repeat (TPR) units'®”° and a multidomain catalytic region. The TPR 
domain is proposed to scaffold interactions with other proteins, which 
may play a role in determining substrate selectivity*’. A crystal struc- 
ture comprising 11.5 TPR units of human OGT has been reported”', 
but there have been no structures of the catalytic region. From 
sequence analysis and structures of bacterial glycosyltransferases***®, 


Figure 1 | Overall structure of human OGT complexed to UDP. 

a, Schematic of OGT domain architecture with the TPR units shown in grey, the 
transitional helix (H3) in purple, the N-Cat domain in blue, the Int-D domain 
in green, and the C-Cat domain in red. The native isoforms of OGT (sOGT, 
short OGT; mOGT, mitochondrial OGT; and ncOGT, nucleocytoplasmic 
OGT) and the crystallization construct differ only in the number of TPRs, as 
shown. b, Overall fold of OGT from the OGT-UDP complex in a ribbon 
representation. The colouring is the same as the schematic in a. The UDP is 
shown in cyan. The N-Cat domain helices unique to OGT are indicated as H1 
and H2. ¢, Surface representation of the OGT-UDP complex. The colouring 
scheme is the same as in a and b. 
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including a bacterial homologue of unknown function®”’, OGT 
was predicted to be a member of the GT-B superfamily of glycosyl- 
transferases (Gtfs)*’. However, OGT is unusual because it is the only 
known member to glycosylate polypeptides and it contains a long 
uncharacterized intervening sequence (~120 amino acids) in the 
middle of the catalytic region. It is also proposed to contain a phos- 
phatidylinositol (3,4,5)-trisphosphate (PIP) binding domain involved 
in membrane recruitment in response to insulin signalling’. 

We report two crystal structures of a human OGT construct 
(hOGT,5) containing 4.5 TPR units and the catalytic domain. The 
catalytic properties of this construct are similar to those of the full- 
length enzyme (Supplementary Fig. 1)°*. One structure (2.8 A, referred 
to as OGT-UDP) is a complex with UDP; the other structure (1.95 A, 
referred to as OGT-UDP-peptide) is a complex containing UDP anda 
well-characterized 14-residue CKII peptide substrate**. On the basis of 
currently available experimental data, we also present a model for the 
full-length enzyme (Supplementary Information). Details of structure 
determination are presented in Methods and Supplementary Tables 1 
and 2. 

The OGT-UDP complex is shown in Fig. 1. The catalytic region 
contains three domains: the amino (N)-terminal domain (N-Cat), the 
carboxy (C)-terminal domain (C-Cat), and the intervening domain 
(Int-D) (Fig. la, b). The N-Cat and C-Cat domains have Rossmann- 
like folds typical of GT-B superfamily members; however, the N-Cat 
domain is distinctive in containing two additional helices, H1 and H2, 


which form an essential part of the active site (Fig. 1b). The Int-D 
domain, which has a novel fold, packs exclusively against the C-Cat 
domain (Fig. 1c). The UDP moiety binds in a pocket in the C-Cat 
domain near the interface with the N-Cat domain’’. This pocket is 
lined with conserved residues shown to be important for catalytic 
activity (Supplementary Table 3)”°”°. A transitional helix (H3) links 
the catalytic region to the TPR repeats, which spiral along the upper 
surface of the catalytic region from the C-Cat domain to the N-Cat 
domain. The TPRs and the catalytic region are demarcated by a narrow 
horizontal cleft. 

The OGT-UDP-peptide complex (Fig. 2), which crystallized in a 
different space group from the OGT-UDP complex, has a wider cleft 
between the TPR domain and the catalytic region than the OGT-UDP 
complex (Fig. 1c and Fig. 2a), and the CKII peptide binds in this cleft. 
This peptide, YPGGSTPVS*SANMM, contains three serines and a 
threonine, but only one serine (underlined; referred to as Ser*) is 
glycosylated by OGT”*. The hydroxyl of Ser* points into the nucleotide- 
sugar binding site (Fig. 2b). The two residues N-terminal to Ser* lie over 
the UDP moiety; the residues C-terminal to Ser* traverse towards the 
back of the cleft along the H2 helix of the N-Cat lobe. Although OGT 
glycosylates a wide range of target peptides, it prefers sequences in 
which the residues flanking the glycosylated amino acid enforce an 
extended conformation (for example, prolines and B-branched amino 
acids; see Supplementary Fig. 2 and Supplementary Table 4). Consistent 
with these preferences, the peptide is anchored mainly by contacts from 
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Figure 2 | Structure of the OGT-UDP-peptide complex. a, Surface 
rendering of the OGT complex with UDP and the CKII peptide substrate”. The 
view and the colouring is the same as in Fig. 1. The peptide, shown in yellow, lies 
over the UDP moiety, which is not visible in this orientation. b, Close-up 
surface rendering of the OGT active site (grey) containing the CKII peptide ina 
stick representation (carbon atoms shown in yellow) with the UDP (purple) ina 
space filling representation lying directly underneath it. The reactive serine is 
indicated by an asterisk. The peptide binds in the cleft between the TPR region 
and the catalytic region, and extends along the interface between the C-Cat and 
N-Cat domains. Protein residues implicated in catalytic activity are coloured 
orange, green, or blue in decreasing order of importance based on residual 
activity after mutation (Supplementary Table 3). Lysine 842 (orange) lies 
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underneath UDP in this view. c, View of UDP (carbon atoms shown in cyan) 
and part of the CKII peptide (carbon atoms shown in yellow) with selected 
OGT side chains shown. Dashed lines indicate inferred hydrogen bonds based 
on distances in the OGT-UDP-peptide complex. The 2F, — F. omit map is 
contoured at the 1a level. d, Proposed mechanism of OGT. The ordered 
sequential bi-bi kinetic mechanism shown is based on the structure of the 
ternary complex and supporting kinetic experiments (Supplementary Fig. 4). 
The peptide is depicted in yellow with only the reactive serine hydroxyl shown. 
H498 is the proposed catalytic base. Lys 842, also shown to be essential for 
activity”*”°, stabilizes the UDP moiety. e, Histogram showing the relative 
activities of the H498A and H558A mutants compared to the wild-type (WT) 
protein (average + s.d., n = 3). 
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OGT side chains to the amide backbone, with an additional contact 
from the UDP moiety to the backbone amide of Ser*. The cleft is also 
filled with ordered water molecules, enabling it to serve as an adaptable 
interface to bind a range of polypeptides containing side chains of 
different sizes, polarity, and hydrogen bonding capabilities. As the pep- 
tide substrate is anchored by contacts to its backbone, it is reasonable to 
infer that protein substrates are glycosylated on flexible regions such as 
loops or termini that can bind in an extended conformation, exposing 
the amide backbone. 

The closed conformation of the substrate-binding cleft in the OGT- 
UDP structure is stabilized by a ‘latch’ comprising contacts between 
TPRs 10/11 and the H2 helix of the catalytic domain (Fig. 2a and 
Supplementary Fig. 3). Opening of the cleft in the OGT-UDP-peptide 
complex occurs owing to a hinge-like motion around a pivot point 
between TPRs 12 and 13. The two structures suggest that glycosylation 
substrates enter the active site from the face of the enzyme shown in 
Fig. 2a, with the TPR domain restricting or allowing access, depending 
on its conformation and its interactions with the catalytic domain. 
Molecular dynamics simulations indicate that the ‘hinge’ between 
the catalytic domain and the TPR domain is capable of large motions 
that fully expose the active site, which would allow protein substrates 
to approach closely enough for surface loops to enter (Supplementary 
Movie 1). The molecular mechanisms that facilitate or stabilize open- 
ing of the cleft to allow access of protein substrates remain to be 
determined, but may involve interactions between protein substrates 
or adaptor proteins and the other regions of OGT. 

The OGT-UDP-peptide complex, in addition to revealing how 
peptide substrates bind, provides unexpected insights into the kinetic 
mechanism. OGT was previously proposed to have a random sequen- 
tial ‘bi-b? mechanism in which either substrate can bind first*®. The 
structure, however, indicates that the peptide substrate binds over the 
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nucleotide-sugar binding pocket, blocking access to it. Moreover, the 
a-phosphate of the UDP moiety contacts the backbone amide of 
Ser* (Fig. 2c), which helps orient the peptide. The peptide complex 
suggests an ordered mechanism in which UDP-GIcNAc binds before 
the polypeptide substrate. To assess the order of substrate binding, we 
analysed the product inhibition patterns for UDP. At saturating 
peptide concentrations, a competitive inhibition pattern was obtained 
for UDP with respect to UDP-GlcNAc, which is inconsistent with a 
random mechanism, but supports the ordered sequential bi-bi mech- 
anism implied by the crystal structure (Supplementary Fig. 4). 

Another insight from the crystal structure is the identity of the cata- 
lytic base. On the basis of analyses of other GT-B family members, 
including the bacterial OGT homologue, it was proposed that His 558 
is the catalytic base. Although we have verified that this residue is critical 
for catalytic activity, the peptide complex shows that it is more than 5 A 
away from the reactive serine hydroxyl and makes an apparent hydrogen 
bond with the backbone carbonyl of the preceding residue. In contrast, 
His 498, which is invariant in metazoan OGTs but absent in the homo- 
logous bacterial enzyme, protrudes from helix H1 into the active site 
within 3.5 A of the Ser* hydroxyl. As His 498 is critical for activity and is 
located between the reactive serine hydroxyl and the GlcNAc binding 
pocket, it is the probable catalytic base in OGT. 

We were unable to obtain a crystal of the OGT-UDP-GlcNAc complex 
owing to hydrolysis of the substrate, but according to the computational 
docking experiments we performed, the GlcNAc is oriented in a manner 
that exposes its B-face to the overlying peptide (Supplementary Fig. 5) and 
places the anomeric carbon near the reactive serine. This conformation is 
similar to the UDP-GlcNAc conformation observed in a complex of 
another GT-B family member”, and its relevance is supported by evid- 
ence that the C2 N-acetyl moiety projects up from the OGT sugar binding 
pocket”. Furthermore, it is consistent with the enzymatic reaction, which 


Figure 3 | Structure of the intervening domain and full-length models of 
human OGT. a, Ribbon representation of the intervening domain rendered in 
light green with missing loops represented by dotted lines. Lysine side chains 
that form an extensive positive surface (see b) are displayed in a ‘ball-and-stick’ 
representation. Shown in coral is a helix from the C-cat domain containing four 
basic residues that contribute to the positively charged surface’. b, Surface 
representation of OGT coloured according to electrostatic potential, with blue 
representing areas of positive charge and red representing areas of negative 
charge. The protein is rotated 90° around the x-axis from the representation 
shown in Figs 1, 2 and 3c, exposing the bottom surface of the catalytic region. 


c, Model of full-length human OGT, shown asa surface rendering and coloured 
as in Fig. 1a, based on the hOGT 5 structures and the previously reported TPR 
domain structure. The TPRs preceding the boundary of hOGT,; are shown in 
light grey. The model is shown as a monomer, but OGT may exist in different 
oligomerization states in cells’'”*. Hinge and latch regions are indicated by 
arrows. d, Model of full-length OGT opening to accommodate larger 
substrates. The ‘open’ conformation is based on molecular dynamics 
simulations (Supplementary Movie 1), as described in Methods. (Atomic 
coordinates for full-length models are available for download; see 
Supplementary Information.) 
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involves displacement of the o-UDP group to yield an inverted product. 
On the basis of the accumulated biochemical and structural data, we 
propose a general mechanism for the reaction (Fig. 2d). 

The most unusual feature of OGT is the intervening domain between 
the catalytic lobes, which is only found in metazoans (Supplementary 
Figs 6 and 7). This polypeptide adopts a topologically novel fold with a 
seven-stranded f-sheet core stabilized by flanking «-helices (Fig. 3a). 
There are two long unstructured loops for which electron density is 
missing. An electrostatic surface rendering shows that the intervening 
domain and an adjacent helix of the C-Cat domain form a large basic 
surface comprising ten lysine residues (Fig. 3a and b). Among these are 
K981 and K982, which were previously reported to constitute part of a 
PIP; binding motif that recruits OGT to membranes’. We mutated 
eight of these ten lysines in various combinations (Supplementary 
Table 3). All mutants were catalytically active (Supplementary Fig. 8), 
but we were unable to identify a role for the Int-D domain in PIP; 
binding (Supplementary Table 5). We suggest that this domain is 
involved in other functions in vivo. These functions may include sub- 
strate selection, cellular localization, or interactions with regulatory 
factors or receptors. The reported structures and mutant data provide 
a crucial starting point for investigating the possible roles of the inter- 
vening domain. 

The structures reported here show how OGT recognizes peptide 
sequences and provide new information on the enzymatic mechanism 
as well as a view of the intervening domain. Models of full-length 
human OGT in its open and closed states, constructed on the basis 
of crystal structures and molecular dynamics simulations, highlight 
the conformational changes that may regulate access of substrates to 
the active site (Fig. 3c and d). Our structures may assist in the develop- 
ment of inhibitors with possible therapeutic value for treating diseases 
associated with excessive O-GlcNAcylation. 


METHODS SUMMARY 


Human OGT residues 313-1031 (CPTH...KPVE) were expressed in Escherichia 
coli and purified by nickel affinity chromatography and gel filtration. Protein was 
then incubated with UDP or with UDP and a 17-residue substrate peptide 
(KKKYPGGSTPVSSANMM), which was cleaved to YPGGSTPVSSANMM in 
the crystallization drop (confirmed by mass spectrometry). The OGT-UDP struc- 
ture was determined using the method of multiple isomorphous replacement with 
anomalous scattering (MIRAS) (Supplementary Table 2). The OGT-UDP-pep- 
tide complex structure was solved by molecular replacement using the refined 
OGT-UDP structure. The crystal packing for the two complexes is described in 
Supplementary Fig. 9. Kinetic analysis was performed using UDP-'*C-GlcNAc 
and a lysine tagged CKII peptide using our previously described filter binding 
assay~’. The molecular dynamics simulation was performed by using the program 
Desmond” on an optimized 64-node Linux-based InfiniBand cluster. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein purification. Full length human OGT (ncOGT) was expressed as previ- 
ously described. The OGT,5 construct (spanning residues 313-1031 based on the 
numbering of the full length human protein) was constructed from our previously 
reported E. coli codon-optimized construct using primers listed in Supplementary 
Table 6 after being cloned into a pET24b vector (Novagen)”. After plasmid 
transformation into BL21 (DE3), the protein was expressed as a fusion protein 
with an N terminus consisting of a T7 tag, followed by an 8-His tag, followed by an 
HRV3C protease cleavage site (LEVLFQGP). Cultures were grown at 37 °C after 
diluting an overnight culture 1 to 100 in fresh LB media. Cells were grown to an 
Agoo Of 1.1, at which point they were transferred to a temperature of 16 °C. After 
letting the cells grow at 16 °C for 30 min, they were induced with 0.2 mM IPTG and 
grown overnight at 16 °C for 16 h. Cells were pelleted, resuspended in TBS (20 mM 
Tris, pH 7.4, 250mM NaCl) supplemented with 1mM PMSF and 0.1 mgml"* 
lysozyme, lysed and the lysate was centrifuged at 5,000g for 20 min to remove 
unbroken cells. The supernatant was then centrifuged at 100,000¢ to further clean 
the lysate. Imidazole was then added to the supernatant to a final concentration of 
40 mM before the lysate was incubated with Ni-NTA agarose superflow resin 
(Qiagen) which was prewashed with TBS + 40mM imidazole for batch nickel 
affinity purification. After incubating the lysate and the resin with gentle rocking at 
4°C, the flowthrough was removed and the resin was washed with 10 column 
volumes of TBS + 50 mM imidazole. The protein was then eluted with 4 column 
volumes of TBS + 250 mM imidazole. The eluate was supplemented with 0.5 1M 
THP to prevent aggregation and then concentrated with centrifugal concentrators 
(Millipore). After protein concentration determination, the N-terminal tags were 
cleaved by adding HRV 3C protease (EMD) to the concentrated purified protein at 
a ratio 1 unit/150 1g of protein and incubating at 4 °C for 16 h. Following cleavage, 
the protein was further purified by gel filtration on a Superdex 200 column (GE 
Healthcare) in TBS (20 mM Tris, pH 8.0, 150 mM NaCl) + 0.5 uM THP (EMD). 
The fractions were collected and concentrated using centrifugal concentrators 
again. The hOGT,5 protein was monomeric in solution, as determined by gel 
filtration and sedimentation equilibrium analytical ultracentrifugation. The protein 
was then diluted 1:1 in water before setting up crystals. 

Native crystals. All crystals were grown with the hanging drop method at room 
temperature. For the UDP structure, 7 mg ml‘ protein was incubated with 1 mM 
UDP for several hours at 4°C. After screening, optimal crystals were obtained 
when 10 ul of protein was mixed with 5 ul of reservoir solution containing 1.45 M 
potassium phosphate dibasic, 8mM EDTA, and 1% xylitol. After several days, 
hexagonal rod crystals grew, to a maximum size of about 400 X 100 X 100 um. 
Crystals were flash frozen using a cryoprotectant consisting of 1.8 M potassium 
phosphate and 27% xylitol. For the peptide-UDP complex, OGT was incubated 
with 1 mM UDP and 2 mM CKII3K peptide**”’ for several hours at 4 °C. Crystals 
were obtained by mixing 8 tl protein solution with 4 ul reservoir containing 1.6 M 
Li,SO, and 0.1M bis-tris propane-HCl pH 7.0 (1,3-bis(tris(hydroxymethyl) 
methylamino)propane). Trapezoidal crystals appeared after several days. 
Crystals were frozen in a cryprotectant consisting of 1.72 M Li,SOy, 0.05 M Bis 
Tris Propane, pH 7.0 and 28% xylitol. 

Heavy metal soaks. Several heavy metal compounds were screened using the 
method of ref. 31. After identifying several promising heavy metal compounds, 
the following conditions gave useful derivatives: K,PtCl,, 10 mM, 1 h soak; sodium 
aurothiomalate, 10 mM, 15min soak; K,PtCl4, 10 mM, 10min soak; KjPtBry, 
1 mM, 1h soak. 

Data collection. All the data were collected at NSLS X29 or X25 at Brookhaven 
National Laboratory except for the gold derivative, which was collected at ID24C 
at APS at Argonne National Laboratory. The heavy metal derivatives were col- 
lected at the following peak wavelengths: gold at 1.0384A and platinum at 
1.0715 A. The UDP structure and all the derivatives belonged to the space group 
P321. The peptide complex crystals were [2. All data sets were processed with 
iMosflm” and scaled using SCALA”. 
Structure determination and refinement of the OGT-UDP structure. The 
structure of the native OGT-UDP complex was determined by using MIRAS with 
the program SHARP™. The native data set and all the heavy atom derivative data 
sets were processed with iMosflm and Scala. Heavy atom sites in the K,PtCl, 1h 
soak data set were first determined by using HKL2MAP*. SAD phases were then 
obtained with the CCP4 program Phaser*® (Experimental Phasing). These initial 
phases were then used to find the heavy atom sites in the other data sets using the 
CCP4*’ program FFT. After obtaining all the sites, multiple isomorphous replace- 
ment with anomalous scattering (MIRAS) phases to 4.4 A were obtained using 
SHARP. The figures of merit at this resolution were 0.46329 (acentric) and 0.47049 
(centric). After MIRAS phasing, the map was interpretable, and we confirmed that 
there were four monomers in the asymmetric unit. Density modification and 
phase extension to 2.78 A with NCS averaging were performed using DM, yielding 
a map with clear side chains. A model was built using as a guide both the structure 
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of the bacterial homologue (using a homology model generated with Swiss Model) 
and the heavy atom locations. There are two loops in the intervening domain for 
which there is no electron density, so these residues are omitted from the model. 
Twelve residues are missing from one loop and four from the other. The structure 
was refined with CNS*. Initial rigid body refinement optimized the placement of 
the monomers and then the components of each monomer. After several iterative 
rounds of simulated annealing, individual B factor refinement, and manual adjust- 
ments using COOT”, the UDP and waters were added, and the structure refined to 
an Ryork Of 21% and an Réee of 24%. Refinement was completed in Phenix” using 
TLS refinement’, minimization, and individual B factor refinement to give a 
final Ryork Of 18.5% and Réree of 21.8%. Figures were prepared using Pymol* and 
CCP4mg™. 

Structure determination and refinement of the OGT-UDP-peptide complex. 
Data were processed with iMosflm and Scala, and the structure was determined by 
molecular replacement. The refined OGT-UDP structure described above was 
used as a search model using the Phaser molecular replacement module* in CCP4. 
Initial molecular replacement efforts showed that whereas the catalytic domain 
was nearly identical in the UDP and UDP-peptide cocomplexes, the orientation of 
the TPRs relative to the catalytic domain was noticeably different. Therefore, the 
model was broken into three parts: the catalytic domain and two sections of the 
TPR domains. Using this approach, a good map and model were obtained, which 
confirmed the twofold NCS present in this structure. The peptide was built by 
hand, as the side chains were already clear enough at this point to place the residues 
properly. The peptide in the crystal structure was cleaved from KKKYPGGSTPVS 
SANMM to YPGGSTPVSSANMM, as confirmed by mass spectrometry. The 
model was then refined with Phenix. As before, repeated rounds of annealing 
and individual B factor refinement were interspersed with manual adjustments 
in COOT. Waters were then added and sulphate ions were added after refining the 
waters. The structure was completed with cycles of annealing, minimization, TLS 
and B factor refinement, leading to a final structure with Rwork of 22.4% and Rgrec of 
25.2%. The crystal packing for the two complexes is described in Supplementary 
Fig. 9. 

Kinetics. Mutants were made from the full-length ncOGT using QuickChange 
mutagenesis and the primers shown in Supplementary Table 6. Kinetic measurements 
were performed using a previously described filter binding assay”. Briefly, reaction 
mixtures containing 500 1M CKII3K peptide (KKKYPGGSTPVSSANMM), 6 1M 
UDP-“C-GlcNAc (300mCimmol' specific activity, American Radiochemicals), 
100nM OGT (WT or mutant protein), and buffer (125mM NaCl, 1mM EDTA, 
20 mM potassium phosphate, pH 7.4, and 500 uM tris(hydroxypropyl)phosphine) 
were incubated at room temperature for 30 min. Reactions were then quenched by 
spotting onto the Whatman P81 phosphocellulose disks, washed three times for five 
minutes in 0.5% phosphoric acid, and counted by liquid scintillation counting. 
Reactions proceeded to <10% conversion under these conditions. Positive and nega- 
tive controls were conducted similarly without enzyme, and positive controls were 
detected by liquid scintillation counting without the phosphoric acid wash step. Data 
were analysed based on triplicate experiments. For product inhibition experiments, 
substrate concentrations were used as described in Supplementary Fig. 2. Reactions 
were allowed to proceed for either 30 min or 60 min and performed in triplicate and 
analysed with linear regression using GraphPad Prism5. 

Model preparation. The hOGT,; construct contains the residues 313-1031 
(CPTH...KPVE) of the full-length ncOGT protein. Because the first two TPR 
units of hOGT, ; overlap with the last two TPR units of the previously crystallized 
human TPR domain (PDB code 1W3B)”, we superimposed each of the hOGT,.5 
structures (PDB codes 3PE3 and 3PE4) with the TPR domain to create composite 
models of full length human OGT. Coordinates are provided in Supplementary 
Data Files 1 and 2. 

Molecular dynamics. The coordinates of the OGT-UDP-peptide complex were 
optimized in the Protein Preparation Wizard (Schrodinger 2009) where hydro- 
gens were added; water molecules, UDP and peptide were stripped; and the struc- 
ture was minimized using the OPLS2001 forcefield. The 1-1m simulation used the 
CHARM27 forcefield*’, and the simple point charge model for water’. The 
CHARM27 forcefield was applied to the system using the VIPARR utility. The 
default Desmond relaxation was performed before simulation, and molecular 
dynamics were run at constant temperature (300 K) and pressure (1 bar). The 
simulation was performed by using the program Desmond, version 2.2.9.1.0° 
compiled by SBGrid on an optimized 64-node Linux-based InfiniBand cluster 
and took 75 days to complete. Molecular dynamics trajectories were processed 
and animated with VMD*. 

Lipid (PIP) binding assays. Recombinant OGT constructs (His- or GST-tagged 
full-length human OGT) were overexpressed in E. coli and purified by affinity chroma- 
tography, using agarose beads conjugated to nickel or glutathione, respectively. PIP 
binding assays were performed using PIP Strips (Echelon Biosciences). Each mem- 
brane was pre-incubated for 2h at room temperature with a blocking solution 
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containing 0.1% ovalbumin 


for GST fusion constructs) or 3% fatty acid free BSA 


(for His-tagged constructs) in buffer TBST (20 mM Tris pH 8.0, 50 mM NaCl, 0.1% 
Tween 20). Purified OGT proteins resuspended in TBST at various concentrations 
(0.2-2 1M) were applied to each membrane. Washing and developing steps were 
performed as outlined in the manufacturer’s protocols, using the same TBST described 
above, and protein was detected using either anti-His or anti-GST antibodies and HRP- 
conjugated secondary antibodies. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


Boggon, T. J. & Shapiro, L. Screening for phasing atoms in protein crystallography. 
Structure 8, R143-R149 (2000). 

Leslie, A. G. W. Recent changes to the MOSFLM package for processing film and 
image plate data. Joint CCP4 + ESF-EAMCB News. Protein Crystallogr. 26, 27-33 
(1992). 

Evans, P. Scalingandassessmentof data quality. Acta Crystallogr. D 62, 72-82 (2006). 
de la Fortelle, E. & Bricogne, G. Maximum-likelihood heavy-atom parameter 
refinement for the multiple isomorphous replacement and multiwavelength 
anomalous diffraction methods. Methods Enzymol. 276, 472-494 (1997). 

Pape, T. & Schneider, T. R. HKL2MAP: a graphical user interface for phasing with 
SHELX programs. J. Appl. Crystallogr. 37, 843-844 (2004). 

McCoy, A. J. etal. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658-674 
(2007). 

Collaborative Computational Project, Number 4. The CCP4 suite: programs for 
protein crystallography. Acta Crystallogr. D 50, 760-763 (1994). 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


Briinger, A. T. et al. Crystallography & NMR system: A new software suite for 
macromolecular structure determination. Acta Crystallogr. D 54, 905-921 (1998). 
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta 
Crystallogr. D 60, 2126-2132 (2004). 

Adams, P. D. et a/. PHENIX: a comprehensive Python-based system for 
macromolecular structure solution. Acta Crystallogr. D 66, 213-221 (2010). 
Painter, J. & Merritt, E.A. TLSMD web server for the generation of multi-group TLS 
models. J. Appl. Crystallogr. 39, 109-111 (2006). 

Painter, J. & Merritt, E. A. Optimal description of a protein structure in terms of 
multiple groups undergoing TLS motion. Acta Crystallogr. D 62, 439-450 (2006). 
DeLano, W. L. The Pymol Molecular Graphics System. (Delano Scientific, San 
Carlos, CA, 2002). 

Potterton, L. et a/. Developments in the CCP4 molecular-graphics project. Acta 
Crystallogr. D 60, 2288-2294 (2004). 

cCoy, A. J. Solving structures of protein complexes by molecular replacement 
with Phaser. Acta Crystallogr. D 63, 32-41 (2007). 

ackerell, A. D. Jr, Feig, M. & Brooks, C. L. Ill. Extending the treatment of backbone 
energetics in protein force fields: limitations of gas-phase quantum mechanics in 
reproducing protein conformational distributions in molecular dynamics 
simulations. J. Comput. Chem. 25, 1400-1415 (2004). 

Berendsen, H. J. C., Postma, J. P. M., van Gunsteren, W. F. & Hermans, J. in 
Intermolecular Forces (ed. Pullman, B.) 331-342 (Reidel, 1981). 

Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. 
Graph. 14, 27-28, 33-38 (1996). 


©2011 Macmillan Publishers Limited. All rights reserved 


LR 


doi:10.1038/nature09639 


Exome sequencing identifies frequent mutation of the 
SWI/SNF complex gene PBRM1 in renal carcinoma 


Ignacio Varela’, Patrick Tarpey’, Keiran Raine’, Dachuan Huang”, Choon Kiat Ong’, Philip Stephens’, Helen Davies', David Jones', 
Meng-Lay Lin’, Jon Teague’, Graham Bignell’, Adam Butler’, Juok Cho’, Gillian L. Dalgliesh’, Danushka Galappaththige', 

Chris Greenman, Claire Hardy!, Mingming Jia’, Calli Latimer', King Wai Lau!, John Marshall!, Stuart McLaren', Andrew Menzies}, 
Laura Mudie’, Lucy Stebbings', David A. Largaespada’, L. F. A.Wessels*, Stephane Richard®®, Richard J. Kahnoski’, John Anema’, 
David A.Tuveson®, Pedro A. Perez-Mancera®, Ville Mustonen’, Andrej Fischer?!°, David J. Adams", Alistair Rust", 

Waraporn Chan-on?, Chutima Subimerb’, Karl Dykema’, Kyle Furge’’, Peter J. Campbell’, Bin Tean Teh?!*)!4, 


Michael R. Stratton’! & P. Andrew Futreal! 


The genetics of renal cancer is dominated by inactivation of the VHL 
tumour suppressor gene in clear cell carcinoma (ccRCC), the com- 
monest histological subtype. A recent large-scale screen of ~3,500 
genes by PCR-based exon re-sequencing identified several new cancer 
genes in cCRCC including UTX (also known as KDMG6A)', JARIDIC 
(also known as KDM5C) and SETD2 (ref. 2). These genes encode 
enzymes that demethylate (UTX, JARID1C) or methylate (SETD2) 
key lysine residues of histone H3. Modification of the methylation 
state of these lysine residues of histone H3 regulates chromatin struc- 
ture and is implicated in transcriptional control’. However, together 
these mutations are present in fewer than 15% of ccRCC, suggesting 
the existence of additional, currently unidentified cancer genes. Here, 
we have sequenced the protein coding exome in a series of primary 
ccRCC and report the identification of the SWI/SNF chromatin 
remodelling complex gene PBRM1 (ref. 4) as a second major 
ccRCC cancer gene, with truncating mutations in 41% (92/227) of 
cases. These data further elucidate the somatic genetic architecture of 
ccRCC and emphasize the marked contribution of aberrant chro- 
matin biology. 

Exome sequencing based on a solution phase capture approach® was 
performed on seven cases of ccRCC, three of which carry VHL muta- 
tions, and matching normal DNAs (See Supplementary Information 
and Supplementary Table 1). Captured material was sequenced using 
76 base pair paired-end reads on the Illumina GAIIx platform. After 
read alignment, variant calling was performed using a naive Bayesian 
classifier algorithm for substitutions and a split-read mapping 
approach (PinDel® with substantial cancer-aware output filtering) for 
insertion/deletions (See Supplementary Material for details). These 
algorithms aim to identify somatically acquired coding and splice-site 
variants (that is, present in the tumour but not in the matching normal), 
and all mutations reported here were confirmed by PCR-based capillary 
sequencing. In total 156 somatic mutations were identified, of which 92 
were missense, 9 nonsense, 1 canonical splice site, 1 stop codon read- 
through, 11 frameshift and 42 synonymous (Supplementary Table 2). 

In four cases truncating mutations were identified in PBRM1. 
PBRM1 maps to chromosome 3p21 and encodes the BAF180 protein, 
the chromatin targeting subunit of the PBAF SWI/SNF chromatin 
remodelling complex’. The gene is comprised of six bromodomains 
involved in binding acetylated lysine residues on histone tails, two 
bromo-adjacent homology domains important in protein-protein 


interaction and an HMG DNA-binding domain*. PBAF complex- 
mediated chromatin remodelling is implicated in replication, 
transcription, DNA repair and control of cell proliferation/differenti- 
ation*’. The SMARCB1 and BRG1 components of this complex have 
inactivating mutations in rhabdoid tumours*” and BRG1 mutations 
have been reported in several tumour types’®. The PBRM1 mutations 
included three frame-shifting insertions and a nonsense mutation; all 
judged to be homozygous from SNP array and mutant allele read count 
data. PBRM1 was not included in our previous PCR-based sequencing 
screen? and was the only gene, apart from VHL, with recurrent trun- 
cating mutations in the seven cases screened. We next sequenced 
PBRM1 in a further 257 RCC cases, including 36 cases of papillary, 
chromophobe and other non-ccRCC cancers. Truncating mutations 
were identified in a remarkable 88/257 (34%) (Fig. 1) of cases, all 
diagnosed as ccRCC (for full data see Supplementary Tables 3 and 
4). PBRM1 mutations were all found in the context of chromosome 
3p loss of heterozygosity (38/38) where SNP array data was available 
(http://www.sanger.ac.uk/cgi-bin/genetics/CGP/cghviewer/CghHome.cgi). 
Two in-frame deletion mutations were identified—a predicted 
6-amino-acid deletion (p.M1209_E1214delMFYKKE) in the second 
BAH (bromo-adjacent homology) domain likely to be involved in 
protein-protein interactions within the SWI/SNF complex* and dele- 
tion of an isoleucine codon (Ile 57) in the first bromodomain (Fig. 1). 
Both deletions remove amino acids conserved to Caenorhabditis 
elegans and both were in cases with 3p LOH. The ratio of nine missense 
to zero silent mutations suggests that a proportion of the missense 
mutations are likely to be pathogenic. Six of nine missense mutations 
occur in bromodomains and one in the second BAH domain (Fig. 1). 
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Figure 1 | PBRM1 somatic mutations. Representation of PBRM1 transcript 
with boxes BD1-BD6, BAH1, BAH2 and HMG indicating the positions of the 
bromodomains 1-6, bromo-adjacent homology domains and high-mobility 
group domain, respectively. Relative positions of mutations are indicated by 
symbols. Stars, nonsense; dots, missense; red triangles, frameshift deletions; 
black triangles, frameshift insertions; and green triangles, in-frame deletions. 
Splice-site mutations are not depicted. 
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Figure 2 | Analysis of PBRM1 missense mutations. Bars represent histogram 
of the mean score of in silico generated random missense mutations (10,000 sets 
of three mutations that can be scored) and the red disk denotes the mean score 
of the somatic mutations that could be scored (T232P As = —7.78, A597D 
As = —9.69, H1204P As = —2.76). The somatic set is significantly different 
from the null set (P-value = 0.01). They have a higher negative mean score and 
are thus predicted to be more deleterious on average. 


The bromodomains of PBRM1 have been shown to have preferential 
binding to different acetylated lysine configurations of histone tails, 
indicating they may contribute to ‘reading’ of the histone code’. The 
likelihood of the missense mutations having functional impact was 
assessed using a scoring system calibrated with protein domain align- 
ments from Pfam (see Supplementary Methods). Three missense 
mutations (p.T232P, p.A597D and p.H1204P) could be scored with 
these alignments. This set of mutations was predicted to be deleterious, 
having a significantly lower mean score than a typical null set of in silico 
generated random missense mutations falling onto the scorable parts 
of the gene (P-value = 0.01 Fig. 2), making these mutations interesting 
candidates for functional studies. 
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Four PBRM1 truncating mutations have been described in breast 
cancer previously’’. Although there is frequent 3p21 LOH in small-cell 
lung cancer, no evidence for PBRM1 inactivation was found'*. To 
further evaluate the contribution of PBRM1 mutation in human cancer, 
copy number was evaluated and the coding exons were sequenced 
through a series of 727 cancer cell lines of various histologies. SNP array 
copy number analysis (http://www.sanger.ac.uk/cgi-bin/genetics/ 
CGP/cghviewer/CghHome.cgi) identified one homozygous deletion 
in the HCC-1143 breast cancer cell line, described previously’. 
Sequencing analysis identified five homozygous truncating mutations 
(Supplementary Table 5). Frame-shifting deletions were identified in 
the VHL-mutant A704 renal cancer, NCI-H2196 small-cell lung cancer 
and TGBC24TKB gall bladder cancer lines. Nonsense mutations were 
identified in the NCI-H226 squamous-cell lung cancer and PANC-10-05 
pancreatic adenocarcinoma lines. Interestingly, a PBRM1 truncating 
mutation has been reported in a comprehensive pancreatic cancer 
mutational screen’. 

To obtain further support that PBRM1 can act as a cancer gene, we 
examined data from several insertional mutagenesis screens in mice. 
Analyses of transposon insertion sites from a forward genetic screen 
performed using a conditional Sleeping Beauty transposon system’? in 
a mouse pancreatic cancer model"® revealed a significant enrichment 
of insertion events in Pbrm1 amongst all genes hit using Monte Carlo 
simulation analyses as described previously’’. Insertions were found in 
pancreatic dysplasia, intraductal (panIN) and high grade invasive 
tumours, indicating Pbrm1 inactivation is an early event in this model. 
The mixed forward and reverse pattern of insertions is indicative of 
inactivation, as demonstrated by RT-PCR showing premature ter- 
mination of the Pbrm1 cDNA via splicing into the inserted transposon 
(Fig. 3). These data suggest that loss of Pbrm1 cooperates with Kras in 
driving pancreatic tumour development in this model. Intriguingly, 
Setd2, previously implicated human ccRCC, was also found to rank 
significantly in frequency among all insertion sites and two tumours 
had both Setd2 and Pbrm1 insertions. These comparative oncoge- 
nomic data provide independent support for PBRM1 as a cancer gene 
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Figure 3 | Pbrm1 is frequently mutated in a mouse model of pancreatic 
cancer. To identify genes that co-operate with Kras in the formation of 
pancreatic cancer a conditional allele of Kras©/?? and Pdx1-Cre were combined 
with a conditional Sleeping Beauty transposase driver and the T2Onc® 
transposon donor allele”. Expression of Cre results in expression of Kras 
and transposon mobilization within the epithelial compartment of the 
pancreas. Isolation of the transposon insertion sites from a panel of 153 
pancreatic cancers and pre-neoplastic lesions generated from this model 
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revealed a common insertion site in Pbrm1 suggesting that loss of Pbrm1 co- 
operates with Kras©/?” in pancreatic cancer development. Statistical analysis 
was performed as described previously*’. Transposon insertions in the forward 
strand of Pbrm1 are shown in green. Insertions in the reverse orientation are 
shown in red. A chromatogram from sequencing of RT-PCR products from 
one tumour is shown demonstrating splicing of exon 24 of Pbrm1 into the 


inserted transposon, thus truncating the transcript. 
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and suggest further investigation of the role of PBRM1 (and SETD2) in 
human pancreatic cancer is warranted. 

Abrogation of PBRM1 expression via small interfering RNA 
(siRNA) knockdown in ccRCC cell lines was investigated to assess 
possible consequences of PBRM1 loss. Greater than 60% knockdown 
of PBRM1 RNA and protein resulted in a significant increase in pro- 
liferation 4/5 RCC lines (Fig. 4a, b and Supplementary Information). 
No effect was seen, however, in A704 which carries a homozygous 
truncating PBRM1 mutation and expresses no PBRM1, confirming 
the specificity of the assay. Further, knockdown of PBRM1 resulted 
in significantly increased colony formation in soft-agar and increased 
cell migration (Fig. 4c, d), indicative of an increase in transformed 
phenotype. Taken together, these data support PBRMI1 having a 
tumour suppressor role in ccRCC. 

Transcriptional profiling before and after PBRM1 knockdown was 
performed using gene expression microarrays. Gene set enrichment 
analysis following PBRM1 knockdown showed that PBRM1 activity 
regulates pathways associated with chromosomal instability and cel- 
lular proliferation (Fig. 4e and Supplementary Table 6), the latter being 
consistent with previous studies identifying PBRM1 as critical tran- 
scriptional regulator of p21 (also known as CDKN1A) in breast cancer 
cell lines'* and work showing that PBRM1 is implicated in regulating 
TP53-mediated replicative senescence’®. The PBAF complex has been 
shown to localize at kinetochores during mitosis’? and SMARCB1 has 
been implicated in spindle checkpoint control”, which would support 
the loss of PBRM1 giving rise to a chromosomal instability/spindle 
checkpoint expression phenotype. It may be of interest to explore 
further spindle checkpoint control in PBRM1-mutated ccRCC as a 
potential therapeutic opportunity. 
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Previous work has demonstrated that VHL loss alone is insufficient 
for ccRCC tumorigenesis, arguing the need for additional genetic 
events’? (B. T. Teh, unpublished) and has further suggested the 
existence of a 3p21 ‘gatekeeper’ ccRCC mutation on the basis of 
LOH studies”. The data presented here strongly suggest that inactiva- 
tion of PBRMI1 comprises this second major mutation in ccRCC 
development. Nearly all (36/38) PBRM1 mutant cases fall into the 
hypoxia signature group as described previously’, including 13/14 
cases without demonstrable VHL point mutations where expression 
data are available—further indicating the importance of PBRM1 in 
typical ccRCC development. The SWI/SNF complex has been impli- 
cated in the normal cellular response to hypoxia, with impairment of 
the complex rendering cells resistant to hypoxia-induced cell cycle 
arrest’, which would be consistent with selection for frequent loss 
of PBRM1 in ccRCC. Multiple cancers have apparently concomitant 
VHL, PBRM1 and SETD2 mutations, with all three genes mapping to 
chromosome 3p, indicating that the mutations are non-redundant 
functionally. Half (55/107) of cases in this series with a demonstrable 
VHL mutation’ have a PBRM1 mutation. Strikingly, all nine cases with 
a SETD2 mutation have a mutation in either PBRM1 or VHL, with 6 of 
9 cases having mutations in all three genes. Physical linkage of these 
three ccRCC cancer genes together with their potential interaction 
may be the key driver for the large scale 3p LOH seen in most cases 
of ccRCC—being particularly parsimonious in requiring only four 
genetic events to unmask three tumour suppressor genes as opposed 
to six if the genes were on different chromosomes. 

Several other mutated genes of potential interest were identified. In 
particular, ARIDIA encoding the BAF250A subunit of the SWI/SNF 
complex was found to have two heterozygous missense mutations: 
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Figure 4 | Knockdown of PBRM1 expression in RCC cell lines. 

a, Verification of PBRM1 knockdown by western blotting. b, Silencing PBRM1 
increased the proliferation of ACHN and 786-O with wild-type PBRM1, but 
not A704 with a homozygous PBRM1 truncating mutation. Data represent 
means of triplicate experiments with standard deviation, P< 0.01. 

c, Knockdown of PBRM1 enhanced colony formation in SN12C cells. Data 
represent means of triplicate experiments with standard deviation, P< 0.01. 
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Scale bar, 100 uM. d, Knockdown of PBRM1 enhanced cell migration in 786-O, 
SN12C and TK10 cells. Data represent means of triplicate experiments with 
standard deviation, P < 0.01. e, Gene sets that are most significantly 
deregulated following PBRM1 knockdown in three RCC cell lines using curated 
gene sets obtained from MSigDB (http://www. broadinstitute.org/gsea/msigdb/) 
and additional curated gene sets obtained from the PGSEA package (see 
Supplementary Information for details). 
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p-R1020K,c.3059G>A and p.L1872P,c.5615T>C. Both cases (PD2126, 
PD2127) have a PBRM1 truncating mutation. Two homozygous 
ARIDIA deletions were found in SNP 6.0 data (http://www.sanger.ac. 
uk/cgi-bin/genetics/CGP/conan/search.cgi) in the LB1047-RCC ccRCC 
and NCI-SNU-5 gastric carcinoma cell lines and loss of ARID1A expres- 
sion has been reported in RCC”. Frequent truncating ARID1A muta- 
tions have recently been reported in clear cell ovarian carcinoma’”’. 
These data all point to ARIDIA being a cancer gene, likely to be operative 
in cCRCC. PD2127 was also found to have a heterozygous truncating 
mutation in ARIDS5B, related to ARIDIA and recently implicated in 
childhood acute lymphoblastic leukaemia susceptibility”®. The extent to 
which the other mutated genes identified here contribute to cCRCC will 
await large-scale follow-up screens. Similarly, exome and whole genome 
sequencing on a large number of cases is likely to yield further insights. 

The identification of a second major cancer gene in ccRCC further 
defines the genetic and molecular architecture of this tumour type. It is 
remarkable that PBRM1, like the majority of the other non-VHL 
mutated cancer genes identified in ccRCC, is involved in chromatin 
regulation—again at least in part at the level of histone H3 modifica- 
tion and recognition. Understanding the contribution of PBRM1 
mutation to clinical disease progression and outcome as well the 
potential for exploiting SWI/SNF complex abrogation therapeutically 
are important future areas of renal cancer research. 


METHODS SUMMARY 


DNA samples from ccRCC patients tumour and matching normal were all obtained 
under local IRB and LREC approvals for this study and processed as described 
previously’. DNA fragmentation, library preparation and solution phase hybrid 
capture were according to manufacturer instructions (Agilent Technologies) and 
modified from protocols published previously’. Capillary-based Sanger sequencing 
for confirmations and PBRM1 follow-up were done as described previously” with 
manual inspection of all sequencing traces. mRNA was extracted from snap-frozen 
mouse pancreatic lesions and subjected to RT-PCR using a nested PCR approach 
using primers of mouse Pbrm1 exon 23/24 and the Carp-B-Actin Splice acceptor 
sequence of the T2Onc transposon cassette. Resulting bands were gel-purified and 
subjected to capillary-based Sanger sequencing. PBRM1 or scrambled control 
siRNAs (Santa Cruz) were transfected into ccRCC cell lines using Lipofectamine 
2000 (Invitrogen) according to the manufacturer’s conditions. Real-time PCR and 
western blotting were all done using standard protocols essentially as described’. 
Expression analyses were carried out as described previously’. 
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Cutaneous malignant melanoma isa highly aggressive and frequently 
chemoresistant cancer, the incidence of which continues to rise. 
Epidemiological studies show that the major aetiological melanoma 
risk factor is ultraviolet (UV) solar radiation, with the highest risk 
associated with intermittent burning doses, especially during child- 
hood'”. We have experimentally validated these epidemiological 
findings using the hepatocyte growth factor/scatter factor transgenic 
mouse model, which develops lesions in stages highly reminiscent 
of human melanoma with respect to biological, genetic and aetiolo- 
gical criteria, but only when irradiated as neonatal pups with UVB, 
not UVA**. However, the mechanisms underlying UVB-initiated, 
neonatal-specific melanomagenesis remain largely unknown. Here 
we introduce a mouse model permitting fluorescence-aided melano- 
cyte imaging and isolation following in vivo UV irradiation. We use 
expression profiling to show that activated neonatal skin melanocytes 
isolated following a melanomagenic UVB dose bear a distinct, per- 
sistent interferon response signature, including genes associated with 
immunoevasion. UVB-induced melanocyte activation, characterized 
by aberrant growth and migration, was abolished by antibody- 
mediated systemic blockade of interferon-y (IFN-y), but not type-I 
interferons. IFN-y was produced by macrophages recruited to 
neonatal skin by UVB-induced ligands to the chemokine receptor 
Ccr2. Admixed recruited skin macrophages enhanced transplanted 
melanoma growth by inhibiting apoptosis; notably, IFN-y blockade 
abolished macrophage-enhanced melanoma growth and survival. 
IFN-y-producing macrophages were also identified in 70% of human 
melanomas examined. Our data reveal an unanticipated role for 
IFN-y in promoting melanocytic cell survival/immunoevasion, 
identifying a novel candidate therapeutic target for a subset of 
melanoma patients. 

Mechanisms associated with UV-mediated alterations to melano- 
cytes and their microenvironment have been inscrutable because they 
cannot be adequately studied in cultured cells. Moreover, melanocytes 
represent only ~1% of skin cells, and bear few specific cell-surface 
markers permitting efficient isolation. To enable detailed study of 
melanocyte biology in vivo, we generated a mouse model in which 
expression of the reverse tetracycline-activated transactivator rtTA2s- 
M2, characterized by minimal leakiness and background, was regulated 
by the melanocyte-specific dopachrome tautomerase (Det) gene pro- 
moter (Supplementary Fig. 1a). Dct-rtTA mice bred with transgenic 
mice bearing a histone H2B-GFP fusion construct controlled by the 
tetracycline response element (TRE) created Dct-rtTA/TRE-H2B- 
GFP bi-transgenic mice (hereafter iDct-GFP) (Supplementary Fig. 1b). 
iDct-GFP mice showed an inducible GFP profile from embryonic to 
adult stages consistent with known Dct expression patterns. GFP 
expression was observed in embryonic neural crest, retinal pigment 
epithelium and telencephalon, as expected (Fig. 1a and Supplementary 


Fig. 2). Neonatal and adult skin GFP™ cells were strictly localized to 
hair follicles, where most GFP* cells were in bulb regions, with smaller 
numbers in the outer root sheath and bulge regions, harbouring 
melanocyte precursors® (Fig. 1b). Co-localization of GFP and anti- 
Dect antibody by immunohistochemistry unequivocally identified 
GFP" cells as melanocytes (Fig. 1c). No background GFP expression 
was detectable without doxycycline. Full GFP induction was achieved 
within 12-18h of a single intraperitoneal injection of a non-toxic 
doxycycline dose in neonatal or adult mice (Supplementary Fig. 1c). 

Reasoning that new clues to the molecular mechanism(s) under- 
lying UV-induced melanomagenesis would be found within the geno- 
mic response of melanocytes to UV radiation, we used the iDct-GFP 
mouse to examine the responses to UVB versus UVA of melanocytes 
residing in situ, within their natural morphological and physiological 
microenvironment. Precisely defined wideband wavelengths and 
physiologically relevant doses* (see Methods) of UV (Supplementary 
Fig. 3) were used to irradiate postnatal day 1 (PD1) iDct-GFP mice, 
and skins were harvested at certain time points after irradiation. To 
avoid potential toxicity from chronic expression and interference with 
UV absorption, GFP was doxycycline-induced after UV irradiation, 
24h before skin harvest. GFP* melanocytes were phenotypically acti- 
vated, characterized by raised melanocyte numbers and migration 
towards the epidermis as previously reported®, 24-48 h after exposure 
to UVB, but not UVA, peaking at 3 days and lasting at least 10 days 
after irradiation (Fig. 1d and Supplementary Fig. 4a). UVB-induced 
melanocyte activation was specific to neonatal irradiation; adult mice 
irradiated at PD29 did not show this response (Supplementary Fig. 4b). 

We performed an expression microarray study on melanocytes iso- 
lated from dorsal skin of iDct-GFP pups irradiated at PD1 with UVB 
or UVA (Fig. le). Doxycycline-induced GFP-labelled melanocytes 
were isolated via fluorescence-activated cell sorting (FACS) at 1 day 
and 6 days after irradiation (ages PD2 and PD7, respectively); arrays 
from 1 day post-UV would reflect the acute UV stress response of in 
vivo melanocytes, whereas the 6 days post-UV time point should 
uncover responses persisting after the acute stress response subsides. 
FACS isolation consistently yielded >95% melanocyte enrichment 
(Supplementary Fig. 5). Gene expression profiling produced robust 
data with good reproducibility among biological triplicates (Fig. 2a), 
and confirmed the absence of detectable levels of contaminating skin 
cell types, including keratinocytes, fibroblasts and adipocytes (Sup- 
plementary Fig. 6). 

UVB elicited a potent, transient, acute stress response in melano- 
cytes, including increased expression of p53 target genes (for example, 
p21?! cyclin G1 and reprimo), whereas UVA-associated changes 
were subtle (Fig. 2a and Supplementary Fig. 7). Intriguingly, a small 
subset of genes showed a delayed response evident at 6 days after UVB 
exposure (Fig. 2a), including a putative IFN-responsive gene signature’ 
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Figure 1 | Melanocyte-specific GFP expression reveals UVB-induced 
activation. a, Embryonic day 11.5 iDct-GFP embryo showing GFP* cells in 
neural crest (red arrow) and telencephalon (white arrow); magnification, 10. 
b, In 7-day old pup skin GFP* cells are located in the bulb (lower arrow) and 
bulge (upper arrow) regions of hair follicles. Blue, 4’,6-diamidino-2- 
phenylindole (DAPI); magnification, x40. D, dermis; E, epidermis. 

c, Immunohistochemistry with anti-Dct antibody shows co-localization with 
GFP in iDct-GFP skin; magnification, 100. d, UVB-induced activation of 
melanocytes, characterized by proliferation and migration towards epidermis. 
Dorsal skins were examined at 1 day (at age PD2) and 6 days (PD7) after 
irradiation. Scale bars, 40 jum. e, Schematic of the regime for isolating GFPt 
melanocytes by FACS. Pups are irradiated at PD1, and dorsal skins harvested at 
either PD2 (24h post-UV) or PD7 (6 days post-UV). Doxycycline (Dox) 
injections are always given after irradiation, 24h before skin harvest. 


(Fig. 2a and Supplementary Table 1). Four upregulated genes (Ccl8, 
Ctla4, H2-K1 and H2-T23) from this group were validated by quanti- 
tative reverse transcription—polymerase chain reaction (qRT-PCR) 
(Fig. 2b). The response was neonate specific (Supplementary Fig. 8), 
and included genes implicated in conferring immunoevasiveness (that 
is, Ctla4, H2-T23, H2-M3, Cfb (also known as H2-Bf) and C4a (also 
known as Sip)). 

To determine whether IFN signalling has a biologically significant 
role in UVB-induced melanocyte activation, we blocked both type-I 
(IFN-« and IFN-f) and type-II (IFN-y) interferons by neonatal 
administration of anti-IFN-« receptor 1 (IFN-«R1) and anti-IFN-y 
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antibodies, respectively. Melanocytes from 6-day post-UVB skin were 
activated in the presence of isotype control antibody, whereas the anti- 
IFN-oR1+anti-IFN-y antibody combination completely abolished 
this response (Fig. 2c). Moreover, although antibody-mediated block- 
ade of type-I IFN-«/B signalling alone failed to overtly affect UVB- 
mediated activation, blockade of IFN-y alone markedly inhibited this 
response (Fig. 2c). These results were corroborated by flow cytometric 
quantification of GFP* skin cells from each antibody-treated group 
(Supplementary Fig. 9). We next isolated GFP’ melanocytes from 
UV-irradiated neonates blocked with either anti-IFN-y or anti-IFN- 
a#R1, and compared their expression patterns to control. Anti-IFN-y 
and anti-IFN-«R1 antibodies repressed expression of a common gene 
set associated with the IFN response (Supplementary Fig. 10). 
However, IFN-y inhibition also more potently repressed expression 
of several non-classical major histocompatibility complex (MHC) 
class Ib antigens (for example, H2-T23 and H2-Q2), as well as 
Psmb9, Gbp2, Icam, Irfl and Fosb; members of this gene subset should 
be responsible for the observed melanocytic phenotypes. Notably, 
IFN-y blockade exclusively and potently inhibited expression of che- 
mokine Ccl8 (also known as MCP-2). 

We determined the IFN-y source by interrogating 6-day post- UVB- 
irradiated skin for immune-cell infiltration. Immunohistochemistry 
failed to detect T cells, B cells, dendritic cells, natural killer (NK), or 
NK-T cells; however, CD11b* cells of myeloid origin were evident 
(Supplementary Fig. 11). Anti-F4/80 and anti-Gr-1 antibodies iden- 
tified these as macrophages (F4/80* Gr-1~ ), not neutrophils (Fig. 3a, 
upper panel). Nearly 90% of CD11b* cells were also F4/80* 
(Supplementary Fig. 12). Notably, the adult response was distinct; 
dorsal skins from PD35 mice 2 days after UVB showed predominant 
Gr-1” cell infiltration, and minimal F4/80* cells (Fig. 3a, lower panel), 
as reported®. 

To determine if infiltrating macrophages expressed IFN-y, as has 
been suggested”, type-I and type-II interferon mRNAs were quantified 
from FACS-purified CD11b* and F4/80* cells. Both had upregulated 
IFN-o and IFN-y expression, and to a lesser extent IFN-B (Fig. 3b). 
Flow cytometry demonstrated that 28% of both CD11b* and F4/80* 
cells expressed IFN-y (Fig. 3c). Our data support the notion that UVB- 
recruited, infiltrating macrophages secrete IFN-y, inducing the IFN 
signature detected in activated melanocytes. Although we were unable 
to detect NK cell markers in FACS-isolated CD11b* cells by qRT- 
PCR (Supplementary Fig. 13), we cannot rule out a possible contri- 
bution by undetectably low numbers of NK or other inflammatory 
cells. 

That UVB-induced chemokines were responsible for neonatal 
macrophage recruitment, particularly Ccr2/Ccr5 ligands, was indicated 
by our melanocyte microarray data and confirmed by qRT-PCR 
(Supplementary Fig. 14a, b). These results were further corroborated 
through treatment of cultured melan-c normal immortalized melano- 
cyte cell line with non-cytocidal UV (Supplementary Fig. 14c). In 
contrast, upregulation of these chemokines was not detected in 
UV-irradiated whole skin (Supplementary Fig. 15) or isolated kerati- 
nocytes'’. Ccr2 and Ccr5 were highly expressed in skin-infiltrating 
macrophages, but not in non-activated RAW264.7 macrophages 
(Supplementary Fig. 16). Finally, mice deficient in Ccr2 were signifi- 
cantly inhibited in their ability to recruit F4/80* macrophages into 
neonatal skin (Fig. 3d); in contrast, Ccr5-deficient mice showed no 
significant difference (data not shown). 

The arrival of IFN-y-expressing macrophages coincided with 
a >100-fold upregulation in melanocyte expression of the Ccr2 ligand 
Ccl8, a known IFN-y-response gene"’, whereas expression of other Ccr2 
ligands had returned to baseline. We propose that recruited IEN-y~ 
macrophages enhance melanocyte Ccl8 expression, reinforcing macro- 
phage-melanocyte interactions and fueling an inflammatory positive 
feedback loop. To confirm its ability to potently chemoattract macro- 
phages, Ccl8 was ectopically expressed in F5061 cells—established from 
a UV-induced melanoma from an immunocompetent hepatocyte 
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Figure 2 | UVB-induced melanocyte activation is mediated by IFN-y. 

a, Unsupervised clustering of complementary DNA (cDNA) microarray 
analysis of gene expression in FACS-sorted melanocytes from 1 day (PD2) or 6 
days (PD7) after UVB or UVA irradiation, and respective unirradiated 
controls. The expanded heatmap (right) shows the delayed induced gene 
subset, which includes multiple genes known to be induced by IFN-y. Primary 
mouse keratinocytes (PrKC) were included as controls. All groups included 


growth factor/scatter factor (HGEF/SF)-transgenic mouse. F5061-Ccl8 
cells were subcutaneously inoculated into syngeneic FVB/N mice, 
markedly enhancing macrophage infiltration into the transplantation 
site (Fig. 3e). Conditioned media from F5061-Ccl8 cells also signifi- 
cantly elevated transmembrane migration of RAW264.7 macrophages 
(Supplementary Fig. 17). 

Macrophages show either anti- or pro-tumorigenic properties’*. F4/ 
80° macrophages were isolated from 6-day post-UVB neonatal skin, 
admixed at a ratio of 1:5 with F5061 melanoma cells and transplanted 
subcutaneously into FVB/N mice. Admixed transplants showed sig- 
nificantly increased growth relative to controls (Fig. 4a and Sup- 
plementary Fig. 18), indicating that these activated macrophages were 
pro-tumorigenic. In contrast, macrophages isolated from spleens of 
unirradiated control pups did not affect tumour growth (Supplemen- 
tary Fig. 19). K,-67 immunohistochemistry showed no difference in 
proliferation between admixed tumours versus controls (Supplemen- 
tary Fig. 20); however, TdT-mediated dUTP nick end labelling (TUNEL) 
assays revealed significantly less apoptosis in admixed tumours (Fig. 4b 
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biological triplicates. b, RT-PCR validation of expression of 4 genes (n = 3 
samples each) from IFN signature (error bars = s.e.m.). ¢, Antibody-mediated 
blockade of interferons by treating pups with intraperitoneal injections of anti- 
IFN-oR1, anti-IFN-7, or both in combination, 1 h before and 3 days after UVB 
irradiation at PD1. The dorsal skins were harvested (n = 3 each group) and 
analysed for melanocyte activation. Representative images are shown. D, 
dermis; E, epidermis. Scale bars, 18 jum. 


and Supplementary Fig. 21), demonstrating that UVB-recruited macro- 
phages promote melanoma cell survival. Immunohistochemistry con- 
firmed that a subset of tumour-associated macrophages maintained 
IFN-y expression (Supplementary Fig. 22). Moreover, macrophage 
presence strongly correlated with enhanced Ctla4 expression in 
F5061 melanoma cells (Supplementary Fig. 23), recapitulating the func- 
tional consequence of macrophage infiltration observed in neonatal 
skin melanocytes (Fig. 2a, b). 

Despite its well-documented anti-tumorigenic activity’, IFN-y has 
also been implicated as a pro-tumorigenic factor'*”’. To determine if 
macrophage-secreted IFN-y was responsible for the enhanced melanoma 
growth observed in admixed tumours, we included intraperitoneal 
administration of either anti-IFN-y or control antibodies. Whereas 
admixed melanomas in the control group showed the expected enhanced 
growth, those in mice given anti-IFN-y antibody showed significantly 
reduced growth (Fig. 4c). Immunophenotyping of tissue microarrays 
containing UVB-induced mouse melanomas showed that most (66%) 
were macrophage-rich, with fewer having T cells (59%) and B cells (32%) 
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Figure 3 | UVB induces chemoattraction of IFN-y-producing macrophages 
into neonatal skin. a, Immunohistochemistry with anti-F4/80 and anti-Gr-1 
antibodies in dorsal skins of UVB-irradiated and unirradiated neonatal (upper 
panel) and adult (lower panel) mice. Scale bars, 40 tum. b, (RT-PCR for IFN-«, 
IEN-f and IFN-y expression in F4/80* and CD11b” cells FACS-isolated from 
neonatal dorsal skins UVB-irradiated at PD1 and examined at PD7, compared 
with non-activated RAW264.7 macrophages. c, Flow cytometric analysis of 

PD1 UVB- irradiated (PD7 examined) skin-cell-suspension-identified IFN-y~ 


(Supplementary Fig. 24). To determine if human melanoma-associated 
macrophages produce IFN-y, we performed dual immunohisto- 
chemistry for CD68 and IFN-y using human melanoma tissue micro- 
arrays. We discovered that 19 of 27 (70%) melanomas examined 
contained abundant macrophages (CD68*), of which all 19 demon- 
strated CD68* IFN-y* dual positivity (Fig. 4d and Supplementary 
Fig. 25). 

In this study we show that UV incites melanomagenesis not only 
through DNA mutagenesis, but also by altering interactions between 
melanocytes and their microenvironment to regulate remodelling of 
UV-damaged skin. On the basis of our results, a model implicating a 
neonatal-specific UV-induced pro-melanomagenic inflammatory 
cascade emerges (Fig. 4e). In accordance with their relative tumori- 
genicity in albino HGF/SF-transgenic mice*, UVB not UVA induces 
melanocytic expression of multiple chemoattracting Ccr2 ligands 
(Cel2, Cel7, Cel8, Ccl12), recruiting Ccr2* macrophages into neonatal 
skin. IFN-y from recruited macrophages stimulates melanocyte pro- 
liferation and migration, and the expression of genes implicated in 
immunoevasion/survival. 
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macrophages. PE, phycoerythrin. d, Flow cytometric analysis of macrophage 
(F4/80~) infiltration into skin, 2 days after UVB irradiation in Ccr2-deficient 
pups (irradiated at PD1), as compared to wild-type pups. **P < 0.01; one-way 
ANOVA test with post-hoc Tukey analysis. NS, not significant. e, F5061 
melanoma cells ectopically expressing Ccl8 chemoattract F4/80' macrophages 
(red), but not Gr-1* cells, to sites of subcutaneous inoculation in syngeneic 
FVB/N mice. One vector-transfected and two Ccl8-transfected clone cells were 
used. Blue, DAPI. 


Erythemal neonatal UVB causes robust macrophage infiltration and 
is melanomagenic; adult skin responds with a rapid, short-lived neutro- 
phil influx, but no melanoma. We propose that mechanisms underlying 
neonatal UVB-induced melanomagenesis operate within the immuno- 
editing paradigm’*: UVB-activated mutant neonatal melanocytes— 
particularly progenitors'°—exposed to inflammation evade immune- 
mediated elimination, persisting through an extended equilibrium 
phase before evolving into clinically significant melanoma. Macrophage- 
induced melanocyte proliferation would more efficiently fix UV- 
induced mutations in prospective melanoma cells, whereas enhanced 
melanocyte migration could facilitate UVB-associated long-term 
tolerance to melanocytic antigens’” by promoting aberrant melanocyte- 
immune-cell interactions. Moreover, enduring inflammation-associated 
epigenetic alterations occur in transformed cells’, and perhaps in long- 
lived macrophage subpopulations”, indefinitely extending their bio- 
logical effects. 

Notably, our systemic antibody blockade experiments demonstrat- 
ing the importance of physiologically relevant IFN-y in UVB-induced 
melanocyte activation and melanoma cell survival strongly support the 
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Figure 4 | IFN-y mediates pro-tumorigenic effects of UVB-recruited skin 
macrophages. a, Mean volumes (= s.e.m.) of F5061 melanomas admixed with 
skin macrophages isolated from PD1 UVB-irradiated (PD7-examined) pups 
(n = 10), versus F5061-only controls (n = 9), following subcutaneous 
transplantation in syngeneic FVB/N mice. **P < 0.01. b, Percentages of 
TUNEL plus cells (+ s.e.m.) in admixed and control tumours (n = 6 each). 
** P< 0,001. c, Antibody-mediated blockade of IFN-y significantly inhibits 
pro-tumorigenic effects of macrophages. Cntrl Ab, control antibody. *P < 0.05; 
**P < 0.01; NS, not significant. One-Way ANOVA test with post-hoc Tukey 
analysis. d, Dual immunohistochemistry with anti-IFN-y (red) and anti-CD68 


notion that IFN-y can be pro-tumorigenic as well as anti-tumorigenic, 
depending on the context, intensity and durability of the IFN-y sig- 
nal'*. In fact, an association between IFN-y-induced genes and early 
mouse melanocytic lesions has been reported’, and serum IFN-y has 
been implicated as an independent prognostic indicator for melanoma 
recurrence’. We propose that IFN-y-associated survival mechanisms 
operational in neonatal melanocytes are recapitulated in melanoma, 
contributing to the selection of more aggressive, therapeutically res- 
istant phenotypes. Relevance to human melanoma is supported by the 
detection of macrophage-associated IFN-y expression in most patient 
samples examined, and a clinical trial showing that IFN-y may have 
adverse effects regarding melanoma patient relapse and mortality”. 
We provide the first evidence, to our knowledge, that IFN-y-R signal- 
ling can facilitate melanoma progression, a remarkable discovery con- 
sidering that high-dose IFN-« is used to treat melanoma, albeit with 
limited success”. Non-overlapping functions of type-I and type-II 
interferons are well described’, and strongly supported by our data. 
The IFN ‘survival signature’ associated with UVB-activated mouse 
melanocytes contains genes involved in human melanoma immuno- 
evasion, including non-classical MHC class Ib antigens (mouse H2- 
M3/human HLA-G; mouse H2-T23 (also known as Qa-1)/human 
HLA-E)****. HLA-E suppresses NK and cytotoxic T lymphocytes””*. 


oy CCR2 


Macrophage \ 


(green) antibodies on a human melanoma tissue microarray shows IFN-y- 
expressing macrophages (yellow). Representative tumour is shown. 

e, Schematic representation of the UVB-induced inflammatory cascade leading 
to IFN-y-mediated immunoevasion and survival of melanocytes during 
sunburn-associated remodelling. UV induces release of CCR2 ligands (1); that 
activate CCR2* macrophages (2); which are recruited to neonatal skin (3); 
macrophages secrete IFN-y (4); which activates melanocytes, inducing 
expression of genes that include CCL8, fueling inflammation (5); and 
immunoevasion (6). 


This IFN signature also features complement isoforms C4a and C4b, 
implicated in systemic autoimmunity suppression’, and CTLA4, a 
potent immune evasion facilitator. CTLA4 is also highly upregulated 
in mouse melanoma cells admixed with neonatal macrophages, and 
expressed on human melanoma cells, where it may be involved in 
immune escape**”’. 

Here we identify novel cellular/molecular inflammatory mechanisms 
centred on IFN-y signalling that may underlie the initiation, survival 
and/or outgrowth of UVB-induced melanoma cells. We propose that 
such mechanisms are highly relevant to strategies used by melanoma 
cells to evade immunosurveillance in patients. In what could prove to 
be a paradigm shift, our data strongly suggest that IFN-y/IFN-y-R or 
its downstream pathway members represent promising prognostic 
markers and/or efficacious therapeutic targets in an appropriate subset 
of melanoma patients. 


METHODS SUMMARY 


All mouse studies were performed under the strict guidelines of Animal Study 
Protocols approved by the Animal Care and Use Committees at the National 
Cancer Institute/National Institutes of Health and the George Washington 
University Medical Center. The Dct-rtTA transgenic mice express rtTA2s-M2 
under the control of the Det (Trp2) gene promoter. Thirty-one founder transgenic 
mice were generated by standard microinjection techniques, individually crossed 
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with the TRE-H2B-GFP transgenic mice, and screened for a line that produced no 
leaky background expression of GFP without doxycycline and quickly responded 
to a single intraperitoneal doxycycline injection. One such line was selected to 
make a double homozygous Dct-rtTA/TRE-H2B-GFP transgenic mouse line in 
albino FVB/N background, and males were crossed with wild-type FVB/N females 
to produce litters that were 100% double heterozygous (iDct-GFP). One day after 
birth, the pups were irradiated with UV radiation (UVB, UVA, or sham)* 
(Supplementary Fig. 3). Dorsal skins were harvested at 24h and 6 days after 
irradiation (or sham irradiation). Twenty-four hours before mice were killed 
and dorsal skins harvested, neonates were injected intraperitoneally with doxycy- 
cline at 80 gg ' body weight to activate GFP expression. Skins from 6-8 pups ofa 
litter were pooled to prepare single-cell suspensions using a published protocol”, 
followed by isolation of GFP* melanocytes via FACS. RNA samples isolated from 
these melanocytes were subjected to cDNA microarray on Illumina Murine 
Beadchips v. 2.0 (Illumina). qRT-PCR validation was performed using primers 
listed in Supplementary Table 2. Immunohistochemical and flow cytometric ana- 
lyses were performed using antibodies listed in Supplementary Table 3. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Generation of Dct-rtT A and iDct-GFP mice. DNA fragments from the follow- 
ing plasmids were used to construct the plasmid pDct-rtTA-BGlo, which harbours 
the transgene fragment used to make the Dct-rtTA transgenic mice. Firstly, 
phBGlobin: the vector backbone is pBluescript II KS+ and the insert is a 
BamHI/PstI genomic fragment from the human B-globin gene, which is com- 
prised of partial exon 2, full intron 2, full exon 3 and poly A signal. Secondly, 
pPDct: this plasmid contains the 3.4-kb BamHI/Eco47III fragment from the Det 
gene promoter*'. Thirdly, pBS/IRES-M2: this plasmid contains the rtTA2s-M2 
fragment. It is a variant of pUHrT62-1°, and was obtained from C. Toniatti and H. 
Bujard. The Dct-rtTA transgenic mice were generated using standard microinjec- 
tion techniques in a FVB/N background strain. Dct-rtTA mice were bred with the 
TRE-H2B-GFP transgenic mice** to generate the bi-transgenic iDct-GFP mice. 

Mice. Wild-type FVB/N female breeders, Ccr2~'~ mice (strain B6.129S4- 
Cer2""7J), and Cer5/~ mice (strain B6.129P2-Cer5"”""*"“/J) were obtained 
from Jackson Laboratories and housed under the strict guidelines of the 
Institutional Animal Care and Use Committee (IACUC)-approved protocols. 
Transportation of mice between National Cancer Institute-Frederick and 
George Washington University Medical Center (GWUMC) animal facilities was 
performed by special animal courier service approved by IACUC. 

In vivo UV irradiation. iDct-GFP pups were irradiated with UV radiation (UVB, 
UVA, or sham) as previously described* (Supplementary Fig. 3). The standard 
erythemal dose (SED) is used to compare the sunburning effectiveness of different 
UV-emitting sources. These sources include UV-emitting lamps in suntanning 
beds, welder’s arcs and sunlight among others. By determining the SED one is able 
to compare how efficient the UV-emitting sources are at inducing sunburn or 
reddening in skin relative to each other. In experimental UV studies, many laborat- 
ories use different types of UV sources for a variety of experiments. Thus, deter- 
mining the SED allows one to produce equivalent amounts of ‘sunburning’ 
radiation regardless of the UV spectral output of the different sources used. The 
SED is produced by multiplying the spectral output or irradiance (W m * of the 
UV-emitting source) with the Commission Internationale de l’Eclairage (CIE) 
standard erythemal action spectrum (note that 1 W = 1 Js '). The CIE standard 
erythemal action spectrum is thus used to ‘weight’ the incoming UV radiation on 
an erythemal wavelength basis. The product curve produced from this convolution 
will give the ‘erythemal effective’ irradiance (Werythemal effective m *) upon integ- 
ration of the area under this curve. The erythemal effective irradiance is used to 
calculate the erythemal or sunburning dose by the equation: doseg:ythemal effective = 
irradianceerythemal effective * time (s). By definition, 1 SED = 100] m?. 

Thus, our UVB source, which consists of a UVB bandpass interference filter 
with a half-band width of +5 nm (dimensions: 5.1 cm X 5.1 cm) blocked to 10°? 
to 10 * outside the main bandpass (280-320 nm). This custom-made filter when 
coupled to our 2.5 kW Xenon arc allows for very clean wavelength resolution. The 
100 cm? exposure area is large enough to accommodate up to 12 neonates (1-3 
days old), which are situated in a three-chambered animal holder covered with a 
quartz lid that allows for UV transmission. Holes are drilled into the sides to allow 
for air exchange. This unit sits on top of a turntable rotating at approximately 
3 r.p.m. to average out beam uniformity. The irradiance is measured with a spec- 
troradiometer (StellarNet) and under standard conditions regularly produces an 
irradiance of approximately 0.30 CIE-effective Wm *. The exposure time is con- 
stant at 90min. These conditions regularly produce a CIE-effective dose of 
approximately 14-16 SED. This level of CIE-effective irradiance agrees well with 
our previous calculations using a radiation transfer algorithm over a global 
latitude/longitude grid at Northern or Southern summer mid-latitudes*’. Direct 
terrestrial measurements at these latitudes indicate that under summer sunlight 
conditions on clear days in the Northern or Southern Hemisphere between 11:00 
and 15:00 local time one can measure SEDs in the range of 10 to 20 depending on 
exposure conditions. As an example and as part ofa project to estimate the number 
of SEDs one might receive during the day using, in this case, cyclists training in 
Spain during summer and winter, a recent paper shows just such levels in good 
agreement with our measurements”. 

FACS. Single-cell suspensions from pooled skin samples were prepared using a 
published protocol’®. FACS was performed on a FACSAria flow cytometer (BD 
Biosciences) equipped with a fixed-alignment cuvette flow cell and a Coherent 
Sapphire solid state laser providing 13 mW of 488-nm excitation wavelength. 
Processing rates averaged about 5,000 events per second at 20 psi. Cells of interest 
were gated by a combination of forward and orthogonal light scatter, and GFP 
fluorescence was captured in the detector with a 530/30 bandpass filter. GFP- 
positive cells suspended in PBS plus 1% bovine serum albumin (BSA) were sorted 
as bulk populations into 12 X 75 mm polypropylene tubes. Dorsal skins from a 
combined litter of 6-8 pups gave a disaggregated cell suspension containing about 
30-50 X 10° cells. FACS resulted in collection of 1-3 X 10° GFP-positive mela- 
nocytes per litter. Following FACS collection of cells into PBS plus 1% BSA 
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solution, the cells were centrifuged at 4°C, and the pelleted cells were lysed in 
Trizol Reagent (Invitrogen) for isolation of RNA. Total RNA was isolated from 
cells by Trizol organic extraction followed by RNeasy Micro Kit (Qiagen Sciences) 
procedure. Skin macrophages were isolated by immunostaining with anti-F4/80- 
Alexa Fluor 647 antibody (BioLegend) 

Microarray analysis of gene expression. Three biological replicates for each group 
were used for CDNA microarray analyses. RNA was quantified fluorimetrically and 
assayed for integrity using the Agilent Bioanalyser (Agilent Technologies). One- 
hundred nanograms of RNA was converted to biotinylated cRNA using one round 
of amplification with the Illumina Labelling Kit (Illumina) and one round of T7 
polymerase amplification and hybridized to Illumina Murine Beadchips v. 2.0. After 
hybridization and staining, the arrays were scanned in an Illumina Bead Station, and 
the images processed using Illumina Bead Studio software. 

Statistical analysis of microarray data. The raw microarray data were extracted 
from the BeadStudio software and imported into the R statistical programming 
environment (R Development Core Team)*° and Bioconductor*’. A variance 
stabilization transformation® followed by quantile normalization and quality 
assessment were performed using the lumi package’*. Differential expression 
between groups was calculated using a linear model and empirical Bayes-moderated 
false discovery rates (FDR) for each treated versus untreated line were calculated 
using the limma package*’. A heatmap including probes showing differential 
expression between UVB-treated and untreated with a FDR of 0.05 was generated 
(Fig. 2a and Supplementary Fig. 7). Hierarchical clustering of genes was performed 
using single linkage and Pearson correlation distance metric. Samples were simply 
ordered naturally. 

qRT-PCR. The most highly significant genes identified through the statistical 
analysis of microarray data were validated by RT-PCR. Reverse transcription was 
performed on 0.1-1.0 pg RNA with Superscript III RT system (Invitrogen) fol- 
lowing the manufacturer’s protocol. Real-time PCR was performed with 
Quantitect SYBR Green PCR system (Qiagen Sciences) on a 7900HT Real-Time 
PCR machine (Applied Biosystems). 18S ribosomal RNA was used as the normalizer. 
The primers used in qRT-PCR are listed in Supplementary Table 2. 

Histology and immunohistochemistry. Skin and tumour samples were preserved 
in optimal cutting temperature (OCT) compound and stored frozen at — 80 °C. Five- 
micrometre sections were cut using a cryostat (Leica Microsystems) and were 
observed under a fluorescence microscope (Nikon Instruments) with DAPI 
(Vectashield, Vector Laboratories) counterstaining. The antibodies for fluorescence 
immunohistochemistry were obtained as listed in Supplementary Table 3. 
Haematoxylin and eosin stainings were performed using standard protocols. Select 
haematoxylin and eosin stained sections are shown in Supplementary Figure 26. 

In vivo antibody-mediated blockade. For the IFN-blockade experiment in pups, 
the anti-murine IFN-y neutralizing antibody was the rat IgG clone XMG-6 
(3.19 mg ml ')!°. The anti-murine IFN-oR1 antibody was the mouse IgG1 clone 
MARI-5A3 (purified monoclonal antibodies, 4 mg ml ~ 1)4! The control antibodies 
were rat IgG1 clone GL113 (1.52 mg ml) ascites. The in vivo applications of these 
antibodies have been described’*”’. 0.1-0.2 mg antibody was injected per pup 
intraperitoneally, 1-2 h before and 3 days after UVB irradiation. For IFN-y blockade 
in the tumorigenesis experiment, the anti-IFN-y antibody was XMG-6. The control 
antibody was rat IgG1 against horseradish peroxidase (HRPN; BioXcell). Antibodies 
were administered intraperitoneally as follows: 1 mg per mouse on days — 1, 0, 1, 3, 6; 
and 0.5mg per mouse on days 9, 12, and 15 after inoculation. Tumours were 
harvested at day 18. 

Immunofluorescence flow cytometry and FACS. The immunofluorescence- 
based flow cytometry and FACS sorting were done using antibodies listed in 
Supplementary Table 3, using standard procedures. 

In vitro UV irradiation. The melan-c melanocyte cell line (gift from V. Hearing) 
was irradiated with the Junggust Box, a hand-made enclosed module that houses 
two F20 sunlamps outputting 60% UVB and 38% UVA wavebands. A total UV 
dose of 175Jm 7 was given at a dose rate of 2.0 Wm *. The cells were irradiated 
in 10-cm culture dishes at 80% confluence overlaid with 3 ml PBS and the dish 
covered with plastic saran wrap to filter out any UVC. Following irradiation, the 
cells were incubated for the desired time in normal media (RPMI1640, 10% FBS, 
100,000 U1’ penicillin, 100 mg]! streptomycin sulphate, 2 mM glutamine and 
200 nM 12-O-tetradecanoylphorbol-13-acetate (TPA)). 

Chemotactic assays for Ccl8. The cDNA for mouse Ccl8 was amplified by RT- 
PCR and cloned into the pcDNA3.1 expression vector (Clontech). F5061 cells were 
transfected using FuGene HD (Roche). Empty pcDNA3.1 vector was transfected as 
control. In vitro chemotactic assay was performed on the RAW264.7 macrophage 
cell line in a Transwell system with 8-1m pore size (Corning). 1 X 10° cells were 
seeded in the top well, and the migrant cells were counted following haematoxylin 
staining. The experiment was done in duplicate. Five random microscopic fields 
from each replicate were counted for the number of migrant cells. For the in vivo 
chemotactic assay, 2.5 X 10° F5061-Ccl8 or F5061-vector cells were inoculated 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


subcutaneously in syngeneic FVB/N mice. Skin samples from the area of the 
inoculation were harvested two days after inoculation, preserved in OCT com- 
pound, cryosectioned, and immunohistochemistry with anti-F4/80 antibody was 
performed to assess macrophage infiltration. 

Tumorigenicity assay. F4/80° macrophages were FACS-isolated from dorsal skins 
of pups 6 days after UVB irradiation at PD1. Macrophages were admixed with F5061 
melanoma cells at a 1:5 ratio (0.5 X 10° macrophages: 2.5 * 10° F5061) and inocu- 
lated subcutaneously in syngeneic FVB/N mice (n = 10). Control mice were inocu- 
lated with 2.5 X 10° F5061 cells without macrophages (n = 10). Tumours were 
harvested after three weeks and were measured in three dimensions. Tumour 
volumes were calculated by the formula: V = length x width X depth X x X 1/6. 
For the tumorigenicity experiment with IFN-y blockade, for each of the two groups 
with macrophages, four mice had 0.5 X 10° macrophages but another 6 mice from 
each group had 0.42 X 10° macrophages mixed with 2.5 X 10° F5061 cells. No dif- 
ferences in tumour growth were detected between the mice with these differing 
numbers of macrophages. All four groups contained ten mice, but two mice from 
the F5061 plus control-antibody group died prematurely and were taken out of the 
analysis. 

TUNEL assay. The TUNEL assay was performed using TACS TdT Fluorescein kit 
(R&D Systems) following the manufacturer’s protocol. 

Human and mouse melanoma tissue microarrays. The human melanoma tissue 
microarray was obtained from S. Hewitt at the National Cancer Institute/National 
Institutes of Health. Antigen retrieval was performed by microwave boiling in citrate 
buffer (pH 6). Immunohistochemistry with anti-CD68 and anti-IFN-y antibodies 
(Supplementary Table 3) was performed serially with appropriate fluorescence- 
tagged secondary antibodies. The mouse melanoma tissue microarray has been 
generated by our laboratory and contains melanoma tissues from our HGF/SF 
transgenic mouse model, induced by a single neonatal UV radiation dose. Anti- 
F4/80 (macrophages), anti-CD3 (pan T cells) and anti-B220 (B cells) antibodies were 
used in conjunction with the Vectastain Elite ABC system (Vector Laboratories) 
following the manufacturer’s protocol. 

Statistical analyses. All statistical analyses were performed using Graphpad Prism 
software. Two-tailed Student’s t-test and ANOVA with post-hoc Tukey analyses 


were performed as indicated to validate significant differences. Means + s.e.m. are 
indicated for all statistical analyses. A P value of <0.05 was considered statistically 


significant. 
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Entanglement in a solid-state spin ensemble 


Stephanie Simmons', Richard M. Brown!, Helge Riemann’, Nikolai V. Abrosimov’, Peter Becker®, Hans-Joachim Pohl’, 


Mike L. W. Thewalt”, Kohei M. Itoh® & John J. L. Morton!” 


Entanglement is the quintessential quantum phenomenon. It is a 
necessary ingredient in most emerging quantum technologies, 
including quantum repeaters’, quantum information processing” 
and the strongest forms of quantum cryptography’. Spin ensembles, 
such as those used in liquid-state nuclear magnetic resonance*”, have 
been important for the development of quantum control methods. 
However, these demonstrations contain no entanglement and ulti- 
mately constitute classical simulations of quantum algorithms. Here 
we report the on-demand generation of entanglement between an 
ensemble of electron and nuclear spins in isotopically engineered, 
phosphorus-doped silicon. We combined high-field (3.4T), low- 
temperature (2.9 K) electron spin resonance with hyperpolarization 
of the *'P nuclear spin to obtain an initial state of sufficient purity 
to create a non-classical, inseparable state. The state was verified 
using density matrix tomography based on geometric phase gates, 
and had a fidelity of 98% relative to the ideal state at this field and 
temperature. The entanglement operation was performed simulta- 
neously, with high fidelity, on 10'° spin pairs; this fulfils one of the 
essential requirements for a silicon-based quantum information 
processor. 

Most quantum information processing algorithms applied to spin 
ensembles have been implemented in a regime of weak spin polariza- 
tion. However, owing to the very low purity of the states used, any 
exponential enhancement offered by quantum mechanics disappears 
when the scaling of total resources is considered. Highly mixed, or 
weakly initialized, ensembles are often interpreted as the sum of a 
perfectly mixed component (given by a normalized identity matrix 
in the density matrix representation) and a small amount, ¢, of a pure 
component, po; thus, Prue =(1—e)I/d+epy, where d is the dimen- 
sionality of the state. The I component is invariant under unitary 
operations and is not directly observable by magnetic resonance, 
which produces measurements of the population differences across 
allowed electron and nuclear spin transitions. It is therefore straight- 
forward to ignore the maximally mixed component: this approach is 
called the ‘pseudo-pure approximation’”®. 

There are a number of entanglement witnesses or monotones that 
can distinguish entangled states from (classical) separable ones. A 
widely used test is the positive partial transpose (PPT) criterion, which 
is both a necessary and a sufficient test of entanglement for two 
coupled, spin-1/2 particles’*. Applying this test to the mixed state 
above, Ptrue it can be shown® that the minimum value of ¢ which 
permits the overall state to be entangled is 1/3. 

Typical values for ¢ in liquid-state nuclear magnetic resonance and 
electron spin resonance (using 10-GHz excitation at a temperature of 
5K) are ~10~-° and ~ 107”, respectively. These values are well below 
the required threshold for the PPT test. Thus, although experiments 
performed in this regime provide a valuable test bed for techniques in 
entanglement generation and detection’, the states created are only 
pseudo-entangled, and are fully separable. (A notable exception was 
the use of chemical methods to generate highly polarized hydrogen spin 
pairs’, though that is a single-shot experiment with limited scalability.) 


To overcome this limit, we require states of higher initial purity and a 
method to measure the I component of the density matrix. 

We follow a hybrid approach, using both the electron spin and the 
nuclear spin associated with a phosphorus donor in silicon. Isolated 
donors in isotopically engineered semiconductors are of particular 
interest as they possess excellent decoherence characteristics (both 
the electron and the nuclear coherence times, T>, exceed seconds'’'”), 
can be controlled with high fidelity using microwave and radio- 
frequency pulses'*"’, and are promising for integrating quantum tech- 
nologies into conventional semiconductor devices"». 

Neglecting the weak polarization of the nuclear spin, the initial state 
populations are determined by the electron spin Zeeman energy, as 
shown in Fig. 1a, where « = exp(—gypB/kgT), gis the electron g-factor, 
[tp is the Bohr magneton, kg is Boltzmann’s constant, and B and T are 
the experimental magnetic field and temperature, respectively. At a 
high magnetic field (3.4T) and low temperature (2.9 K), the donor 
electron spin is thermally polarized to ~66%; however, the 31D nuclear 
spin, with a much weaker magnetic moment, has only ~0.04% polari- 
zation. Various methods, collectively known as dynamic nuclear 
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Figure 1 | Sequences for nuclear spin hyperpolarization and entanglement 
generation for this coupled S = 1/2, I= 1/2 spin system. a, The initial state is 
at thermal equilibrium, where populations (green) are distributed according to 
the electron spin (e) polarization at this magnetic field and temperature (see 
text). A pair of applied microwave and radio-frequency m pulses move spin 
populations to favour the \T) nuclear spin (n) state. After some time, twarr > 
Tie, there is a significant majority population in state |3), or |} |) (where the 
first and second arrows indicate the nuclear and electron spins, respectively). 
Nuclear spin and cross-relaxation processes occur on timescales much longer 
than T),. b, Illustration of the 28¢i-P coupled spin system. c, Starting from the 
hyperpolarized state in a, an electron spin coherence is generated and 
transformed into the final entangled state, containing a superposition of ||) 
and |||). d, The growth in the electron spin echo intensity measured on the 
|1)-|3) transition provides a measure of the population ratio, «. a.u., arbitrary 
units. e, This hyperpolarization sequence minimizes the linear entropy of the 
two-spin state for a given value of «. 
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polarization’*”’, exist for indirectly transferring electron spin polariza- 
tion to the nuclear spin and often exploit cross-relaxation processes 
involving simultaneous electron and nuclear spin flips. Here we exploit 
the relative absence of cross-relaxation leading to a substantial differ- 
ence in the relaxation times of the electron and nuclear spins’’, to 
hyperpolarize the nuclear spin rapidly and with high efficiency. This 
hyperpolarization process is similar to ‘algorithmic cooling’ methods, 
whereby a particular quantum bit (qubit) relaxes quickly owing to 
coupling to a heat bath’. 

Figure 1 illustrates our method for tackling the twin challenges of 
measuring and minimizing the I component in the density matrix of the 
coupled electron-nuclear spin system. The hyperpolarization of the 
nuclear spin can be understood as a SWAP operation (which inter- 
changes the states of two qubits) with the (thermally polarized) electron 
spin, using a combination of resonant microwave and radio-frequency 
tt pulses. This is followed by a delay tw arr, which is substantially longer 
than the electron spin relaxation time, T), (specifically, tw arr ~ 8Tie); 
during which the electron spin relaxes back to thermal equilibrium. On 
this timescale, other relaxation processes (such as pure nuclear spin flips 
or electron-nuclear spin flip-flops) are orders of magnitude slower and 
can be neglected. The resulting hyperpolarized state is 


= a (a1) (1| +. |2) (2| +[3) (3| + 14) (4) 


where Z = 2(1 + «) is a normalizing constant. 

Although spin echo sequences can only be used to probe the popu- 
lation differences across energy levels, we can obtain a direct measure 
of the population ratio, ~, by measuring the electron spin echo ampli- 
tude between levels |1) and |3) before and after the hyperpolarization 
sequence, as shown in Fig. 1d. Owing to the enhanced polarization of 
the nuclear spin, a spin echo measured on this transition increases by a 
factor of 2/(~% + 1) in comparison with the measurement from a fully 
relaxed thermal state. This measure is strictly conservative: it places a 
lower bound on the true polarization of the electron, as imperfections 
such as pulse errors or residual relaxation processes only lead to a 
lower apparent state purity. Using this measure, we observe an 
enhancement of the echo intensity by a factor of 1.643(2), correspond- 
ing to an upper bound of « = 0.217(2). 

Linear spin entropy (defined as V[1 —Tr(p*)]/(WV — 1) for an NV - 
dimensional Hilbert space) is a useful characterization of a state’s 
purity, and ranges from one, for maximally mixed states, to zero, for 
pure states. Our hyperpolarization sequence corresponds to a decrease 
in linear spin entropy, made possible by the open quantum system’s 
contact with the lattice heat bath (Fig. le). Importantly, this approach 
leads to the minimum possible linear entropy given the electron spin 
polarization resource and type of relaxation present’*. Entanglement is 
maximized in a mixed, two-qubit density matrix by first minimizing 
the linear entropy and then generating an entangled coherence across 
the levels with the largest and second-smallest populations'””°. Following 
this strategy, we create an entangled state using a coherence-generating 
microwave m'”/2 pulse (where the superscript denotes the pair of levels 
addressed by the pulse) followed by a radio-frequency n** pulse (Fig. 1c), 
yielding the target state: 


l+a 0 0 l1-a 
1 0 20 0 0 
P“2R 1 0 0 2m 0 
l-a« 0 O Il+a 


This density matrix is entangled according to the PPT criterion when 
a = 0.432; other preparation methods (such as pseudo-pure state pre- 
paration) require substantially higher polarization (Supplementary 
Information). 

Having prepared the initial state and performed an entangling 
operation, we now use density matrix tomography to extract the final 
two-spin state. Owing to the weak magnetic moment of nuclear spins 
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and necessarily low donor concentration in our sample, we are 
restricted to non-projective measurements of the electron spin 
ensemble along the o, and oy Pauli bases, which can be performed 
selectively on the m, state of the nuclear spin (in product operator 
formalism, these bases can be written as Sx I” BY 

Diagonal elements of the density matrix (corresponding to state 
populations) are obtained by mapping pairs of population differences 
into an electron spin echo on the |1)-|3) transition (S,., J”). The accur- 
ate detection of off-diagonal elements (coherences) is a more elaborate 
process, made by selectively labelling the coherence between each pair 
of eigenstates with a distinguishable, time-varying phase’. By this pro- 
cess, a particular phase accumulation rate provides the signature of a 
particular coherence, allowing the off-diagonal elements to be recon- 
structed from the amplitudes in the Fourier transform of a measured 
signal. 

Here we follow an approach inspired by the Aharonov-Anandan 
geometric phase gate*’”’ to apply arbitrary phases in a fixed time to the 
four different eigenstates, and thus separately label each of the possible 
coherences. We apply two 1 pulses, along different axes, across a 
transition between a pair of eigenstates. The phase acquired by each 
eigenstate is opposite and equal to half the solid angle of its trajectory 
on the Bloch sphere (Fig. 2a). Thus, applying 2)” followed by — m7, 
(subscripts denote pulse phase and, thus, nominal rotation axis) leads 
eigenstates |1) and |3) to assume trajectories of equal and opposite 
solid angle, +2¢. A similar operation, To followed by — ne, is 
applied to the nuclear spin transition, such that the total operator 
describing the action of these four pulses is 


e * 9 0 0 
0 1 0 0 
MOO=) 4 9 got) 9 


The value of ¢ is incremented by d¢ on each shot of the experiment, 
with effective frequency vy = 2n/d (and similarly for o, do and v,). 
We then map each off-diagonal element of the density matrix in turn 
into S,.,J” using a set of appropriate microwave and radio-frequency x 
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Figure 2 | Electron and nuclear spin phase rotations reveal the off-diagonal 
elements of the density matrix. a, Under the application of two consecutive 
r* pulses around different axes (#), the eigenstates |1) and |3) undergo closed 
trajectories on the Bloch sphere with equal and opposite solid angles, Q = +29. 
Each state picks up a phase equal to half this solid angle. b, This to, —™y phase 
gate is applied to both electron |1)-|3) and nuclear |3)-|4) transitions, where 
the two phases are varied by different increments, 5¢ and da, as the experiment 
is repeated. Example oscillations are shown for three experiments where we 
generate an electron coherence, |1)(3 |, a nuclear coherence, 4)(3| and a zero 
quantum coherence, |2){3|. c, Fourier transforms (FT) of the oscillations with 
respect to increment number show peaks located at the frequencies 0.050(8), 
0.031(5) and —0.079(8), in agreement with the frequencies that were set, 

Vg = 2n/d = 0.05 and v, = 2n/d0 = 0.03. 
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pulses, and measure the amplitude of the Fourier component at the 
effective frequency corresponding to that coherence. Quadrature mea- 
surement allows us to discriminate between positive and negative 
frequencies. The presence of other Fourier peaks would be illustrative 
of pulse errors in the mapping sequence, but as seen in Fig. 2b, c, such 
errors are negligible even in the absence of operations such as phase 
cycling. 

By combining our measurements of the identity component and the 
diagonal and off-diagonal elements of the density matrix of the electron- 
nuclear spin system, we obtain the following expression for p: 


0.382 0.003+0.000i —0.035—0.039i 0.272 
0.003 — 0.0007 0.017 —0.000+0.001i 0.001+0.003i 
—0.035+0.039i —0.000—0.001i 0.174 — 0.055 —0.042i 
0.272 0.001—0.003i —0.055+0.042i 0.427 


This state has a minimum eigenvalue under the PPT test of 
—0.19(1) and a concurrence, C, of 0.43(4), each of which confirms 
the presence of finite entanglement. The results of this tomography 
process are shown in Fig. 3. The fidelity of the measured density matrix 
with respect to the target state, given that « = 0.217, is 98.2(2)%, and is 
68(2)% with respect to an ideal Bell state (~ = 0). To obtain the uncer- 
tainty in these values, we used Monte Carlo generation of physical 
density matrices based on the standard error of each matrix element 
due to noise (Supplementary Information). 

The finite entanglement shown can offer direct advantages over 
classical methods in applications such as quantum sensors”. To 
achieve higher-purity entangled states, we could use lower tempera- 
tures; for example, we would expect C ~ 0.99 if these experiments were 
performed at 0.8K. Complementary to this approach, entanglement 
purification could be performed using a larger Hilbert space at each 
node™, for example using a donor atom with a higher nuclear spin 
(such as bismuth, with I = 9/2). 

The electron-nuclear spin entanglement generated here could also 
be mapped into an entangled state between nuclear spin pairs’. By 
interchanging (by SWAP) the state of the electron spin with a second, 
coupled nucleus, for example, nuclear spin entanglement could be 
attained in a regime where the thermal polarization of the nuclei would 
be orders of magnitude too small and the direct coupling between 
them weak. Clusters of up to eight nuclei coupled to a single electron 
spin have been explored in other materials”, although the scaling of 
such an approach seems limited. A scalable network of entangled 
nuclear spins could be generated by exploiting the ability to ionize 
the donor and transfer the electron onto a neighbouring donor site*””*. 
These operations, combined with single-shot read-out of the phos- 
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Figure 3 | Measuring an entangled density matrix. a, The full pulse sequence 
used to prepare, entangle and measure the two-spin state. The final read-out 
stage was changed according to the density matrix element being measured: 
examples are shown for the |1){2| and |1)(4| states. b, The obtained density 
matrix is shown as solid bars, and the dashed outline (zero where not shown) 
shows that of an ideal state given ~ = 0.217. The fidelity of the ideal state with 
the measured density matrix is 98%. 
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phorus donor spin” and globally controlled electron-nuclear spin 
entanglement such as we have demonstrated, form the basis for a 
cluster-state quantum computer in silicon”. 


METHODS SUMMARY 


Si:P consists of an electron spin, S = 1/2 (g = 1.9987), coupled to the nuclear spin, 
I= 1/2, of *'P through an isotropic hyperfine coupling of a = 4.19mT. The 
W-band electron spin resonance signal comprises two lines (one for each nuclear 
spin projection, M; = +1/2). Our experiments were performed on the low-field 
line of the electron spin resonance doublet, corresponding to M; = 1/2. At 2.9K 
and 3.36 T, the electron and nuclear spin relaxation times were measured to be 
approximately 0.6s and 100s, respectively. 

The sample consists of a **Si-enriched single crystal about 0.5 mm in diameter with 
a residual *’Si concentration of order 70 p.p.m., produced by decomposing isotopi- 
cally enriched silane in a recirculating reactor to produce poly-silicon rods, followed 
by floating-zone crystallization. Phosphorus doping of ~10'* cm” * was achieved by 
adding dilute PH; gas to the ambient argon during the final floating-zone single- 
crystal growth. Further information on the sample growth has been reported 
elsewhere”. 

Pulsed electron spin resonance experiments were performed using a W-band 
(94-GHz) Bruker ELEXSYS 680 spectrometer, modified to allow microwave phase 
control and equipped with a 6-T superconducting magnet and a low-temperature 
helium-flow cryostat (Oxford CF935). The cryostat was pumped to achieve a 
temperature of 2.88 K (internal thermocouple) consistent with the spin temper- 
ature measurement (see text). Typical pulse times were 56 ns for a microwave 7 
pulse and 100 ps for a radio-frequency nt pulse. To achieve arbitrary phase control, 
we generated radio-frequency pulses using a Rohde and Schwarz AFQ100B 
together with an Amplifier Research 500 W amplifier. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature09712 


Branched tricarboxylic acid 
metabolism in Plasmodium 


falciparum 


Kellen L. Olszewski, Michael W. Mather, Joanne M. Morrisey, 
Benjamin A. Garcia, Akhil B. Vaidya, Joshua D. Rabinowitz 
& Manuel Llinas 


Nature 466, 774-778 (2010) 


The samples used for histone proteomics described in this Letter were 
inadvertently switched, such that the U-'*C-glucose and U-'?C-'°N- 
glutamine data were inverted. The plots in Fig. 2b and the spectra in 
Supplementary Fig. 3 have been modified to reflect this. The corrected 
results demonstrate that '*C-labelling of histone acetyl groups occurs 
only in cells grown on '*C-glucose and not on '*C-glutamine. Therefore, 
glucose is the primary source of the acetyl units used for both amino 
sugar biosynthesis and nuclear protein acetylation. Although U-'°C- 
'N-glutamine does give rise to labelled acetyl-CoA, its localization 
and function remain unclear. The model presented in Fig. 4 has been 
modified to reflect these facts, which do not alter the paper’s main 
conclusions about TCA cycle architecture. The corrected Figs 2b and 
4 are shown below. The authors apologize for this error. 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 
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Modelling the long QT syndrome with induced 


pluripotent stem cells 


Ianit Itzhaki!*, Leonid Maizels!, [rit Huber’, Limor Zwi-Dantsis', Oren Caspi', Aaron Winterstern!, Oren Feldman’, 
Amira Gepstein’, Gil Arbel', Haim Hammerman’, Monther Boulos? & Lior Gepstein” 


The ability to generate patient-specific human induced pluripotent 
stem cells (iPSCs)'* offers a new paradigm for modelling human 
disease and for individualizing drug testing*. Congenital long QT 
syndrome (LQTS) is a familial arrhythmogenic syndrome charac- 
terized by abnormal ion channel function and sudden cardiac 
death*>’. Here we report the development of a patient/disease- 
specific human iPSC line from a patient with type-2 LQTS (which 
is due to the A614V missense mutation in the KCNH2 gene). The 
generated iPSCs were coaxed to differentiate into the cardiac line- 
age. Detailed whole-cell patch-clamp and extracellular multielec- 
trode recordings revealed significant prolongation of the action- 
potential duration in LQTS human iPSC-derived cardiomyocytes 
(the characteristic LQTS phenotype) when compared to healthy 
control cells. Voltage-clamp studies confirmed that this action- 
potential-duration prolongation stems from a significant reduction 
of the cardiac potassium current Ix,. Importantly, LQTS-derived 
cells also showed marked arrhythmogenicity, characterized by 
early-after depolarizations and triggered arrhythmias. We then 
used the LQTS human iPSC-derived cardiac-tissue model to evalu- 
ate the potency of existing and novel pharmacological agents that 
may either aggravate (potassium-channel blockers) or ameliorate 
(calcium-channel blockers, K,4-7p-channel openers and late sodium- 
channel blockers) the disease phenotype. Our study illustrates the 
ability of human iPSC technology to model the abnormal func- 
tional phenotype of an inherited cardiac disorder and to identify 
potential new therapeutic agents. As such, it represents a promising 
paradigm to study disease mechanisms, optimize patient care (per- 
sonalized medicine), and aid in the development of new therapies. 

Congenital LQTS*’, now classified into 12 subtypes, is a familial 
arrhythmogenic syndrome characterized by delayed repolarization, a 
prolonged QT interval in the electrocardiogram and a life-threatening 
polymorphic ventricular tachycardia known as torsade de pointes 
(TdP). Although heterologous expression systems® and animal models 
have provided important insights into LQTS pathogenesis, the lack of 
in vitro sources for human cardiomyocytes and the inability to model 
patient-specific disease variations has significantly hampered the study 
of this disease. Here we propose that cardiomyocytes derived from 
LQTS patient-specific human iPSC lines may recapitulate the disease 
phenotype in vitro, providing important mechanistic insights and 
offering a unique platform to evaluate patient-specific disease aggra- 
vators and therapies (Supplementary Fig. 1). 

Dermal fibroblasts were obtained from a 28-year-old woman with a 
diagnosis of familial type-2 LQTS (Fig. 1a) due to a missense mutation in 
exon 9 of the KCNH2 gene (A614V mutation caused by a CT nucleo- 
tide substitution)*. This mutation was previously shown in heterologous 
expression systems to affect the pore-forming region of the KCNH2 (also 
known as HERG)-encoded potassium channel (Supplementary Fig. 1); 
leading to a significant reduction of the rapid component of the delayed- 
rectifier potassium current (Ix,) responsible for type-2 LQTS”. 


The cultured fibroblasts were reprogrammed to generate LQTS 
patient-specific human iPSC clones (Fig. 1b) after transduction with 
retroviral vectors encoding for SOX2, KLF4 and OCT4. Several clones 
were generated, three of which were continuously propagated and used 
for cardiomyocyte differentiation and characterization. Similar control 
human iPSCs were created from fibroblasts from a healthy individual. 
Importantly, the A614V heterozygous mutation was identified in 
LQTS human iPSCs (Fig. 1a) but not in control iPSCs. 

All iPSC clones generated showed characteristic human embryonic 
stem (ES) cell morphology (Fig. 1b and Supplementary Fig. 2), 
expressed the pluripotency markers NANOG, SSEA4, OCT4 and 
TRA-1-60 (Fig. 1b and Supplementary Fig. 2), had alkaline phospho- 
tase activity (Fig. 1b and Supplementary Fig. 3A), and maintained a 
normal karyotype (Fig. 1c and Supplementary Fig. 3B). All clones 
showed silencing of the three retroviral transgenes (Supplementary 
Fig. 4), reactivation of endogenous pluripotency genes (OCT4, SOX2, 
NANOG, FOXD3 and ZFP42 (also known as REX1); Supplementary 
Fig. 5), and demethylation of the NANOG promoter regions (Fig. 1d); 
all indicating successful reprogramming. Pluripotency of the human 
iPSC clones was confirmed by the presence of cell derivatives of all 
three germ layers in in-vitro-differentiating embryoid bodies (Fig. le 
and Supplementary Fig. 6) and by teratoma formation after injection of 
undifferentiated human iPSCs into immunocompromised NOD/ 
SCID mice (Fig. 1f and Supplementary Fig. 7). 

Next we used the embryoid-body-differentiation system to coax 
human iPSC differentiation into the cardiac lineage. Similarly to human 
ES cells’®”’ and healthy human iPSCs'*”, this resulted in the appear- 
ance of spontaneously beating areas within the differentiating embryoid 
bodies (Supplementary Movies 1 and 2). Both the control and LQTS 
human iPSC lines showed comparable cardiomyocyte differentia- 
tion capacities (Supplementary Fig. 8). Immunocytostaining studies 
showed positive staining of the control (Supplementary Fig. 9A) and 
LQTS human iPSC-derived cardiomyocytes (iPSC-CMs) for the sarco- 
meric proteins troponin I (TnI) and «-actinin and for the gap-junction 
protein connexion 43 (Fig. 2a). Similarly, gene expression analysis con- 
firmed the expression of cardiac-specific transcription factors (NKX2- 
5) and structural genes (MLC2V (also known as MYL2), MYH6 and 
MYH/7) by both cell types, including of the relevant KCNH2 gene 
(Fig. 2b). Importantly, both control and LQTS iPSC-CMs showed 
cardiac-specific functional properties. This was manifested by the 
development of action potentials at the cellular level (Fig. 3a), a func- 
tional syncytium at the multicellular level (Fig. 2c and Supplementary 
Fig. 9B) and appropriate chronotropic responses to the f-adrenergic 
agonist isoproterenol (Fig. 2d and Supplementary Fig. 9C). 

Similar to previous reports in human ES cells’? and human iPSCs”, 
three types of action-potential morphologies were recorded from con- 
trol and LQTS iPSC-CMs (Fig. 3a and Supplementary Table 1). The 
most dominant form was the ‘ventricular-like’ morphology (charac- 
terized by a significant plateau phase), with the ‘atrial-like’ (showing 
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Figure 1 | Establishment and characterization of type-2 LQTS human iPSCs 
(clone 1). a, Left panel shows ECGs from the type-2 LQTS patient during sinus 
rhythm (QT interval corrected for heart rate (QTc) = 520 ms) and during 
initiation and sustainment of TdP. PVC, premature ventricular contraction. 
Right panel shows sequencing of the KCNH2 gene identifying the A614V 
heterozygous missense mutation in LQTS human iPSCs. b, Reprogramming of 
the patient’s fibroblasts into LQTS human iPSCs. Shown from left to right are a 
typical human iPSC colony, positive immunostaining for pluripotency markers 
(OCT4, NANOG, SSEA4 and TRA-1-60), and positive staining for alkaline 


triangular-shaped action potentials) and ‘nodal-like’ (showing rela- 
tively depolarized resting membrane potential) types being less pre- 
valent. More importantly, these intracellular recordings revealed 
marked action-potential duration (APD) prolongation in LQTS 
iPSC-CMs (the characteristic LOTS electrophysiological signature*’) 
when compared to control cells (Fig. 3a, b and Supplementary Table 1). 
APD prolongation and the associated reduced repolarization velocity 
were noted in both ventricular-like and atrial-like LQTS human iPSC- 
CMs but not in the nodal-like cells. 

Because APD is rate-dependent, next we electrically stimulated the 
human iPSC-CMs and noted that the differences in APD values 
between LQTS and control cells persisted also at fixed rates (0.5 and 
1 Hz; Fig. 3b and Supplementary Fig. 10). In addition, comparable 
prolonged APD values were noted in ventricular myocytes derived 
from three different independent LQTS iPSC clones, which were all 
significantly longer than those obtained from two different control 
human iPSC lines (Supplementary Fig. 11A). Lastly, the LQTS pheno- 
type could be recapitulated in control iPSC-CMs by pharmacologically 
blocking the Ix, current with the specific blocker E-4031 (Supplemen- 
tary Fig. 12). 

Next we performed single-cell voltage-clamp studies and identified 
the presence of an E4031-sensitive current (Ix,) in control human iPSC- 
CMs (Fig. 3c), which resembled previous Ix, recordings from explanted 
human ventricular myocytes’. Similar recordings in the LQTS iPSC- 
CMs revealed significant reductions of this current (Fig. 3c). Thus, peak 
amplitudes of the Ix, activation currents in LQTS cardiomyocytes 
(n = 6), measured at physiologically relevant depolarization steps (0, 
20 mV), were found to be significantly smaller (by 72 and 60%, respect- 
ively) than those recorded in control cells (P<0.05; Fig. 3d, left). 
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phosphatase (AP) activity. Scale bars, 100 tm. c, Karyotype analysis of LQTS 
human iPSCs. d, Bisulphite sequencing analysis of the NANOG promoter in 
the patient’s fibroblasts and LQTS human iPSCs. e, Immunostaining of in vitro 
differentiating embryoid bodies for nestin (ectoderm), «-fetoproptein (AFP, 
endoderm) and desmin (mesoderm). Scale bars, 50 um. f, Teratoma formation 
following injection of undifferentiated LQTS human iPSCs in NOD/SCID 
mice. Note the formation of pigmented epithelium (ectoderm), gastrointestinal 
epithelium (endoderm) and hyaline cartilage (mesoderm), as identified by the 
arrows. Scale bars, 200 ttm. 


Similarly, peak amplitudes of the Ix, tail currents measured at 20 and 
60mV were also significantly smaller (by 64 and 68%, respectively, 
P< 0.05; Fig. 3d, right) than in control cells. These results are consistent 
with previous recordings in heterologous expression systems, where 
co-injection of wild-type along with mutant A614V cRNA suppressed 
the HERG current in a dominant-negative manner’. 

To evaluate the electrophysiological properties also at the multi- 
cellular level, we studied the human iPSC-derived cardiac tissue with a 
microelectrode array mapping technique’. The recorded extracellular 
electrograms were analysed to measure the field-potential duration 
(FPD; Fig. 2e). This measurement, which was normalized to account 
for variations in beating frequency’® (corrected FPD, cFPD), is analogous 
to the QT interval in the electrocardiogram (ECG) and was previously 
shown to correlate with APD and to predict drug effects on repolariza- 
tion’*”’. Similar to patch-clamp studies, cFPD (or rate-matched FPD) 
measurements were significantly longer in cardiac tissues derived from all 
three LQTS human iPSC clones when compared to healthy control speci- 
mens (Fig. 2e, f, Supplementary Fig. 11B and Supplementary Movie 1). 

Importantly, LQTS iPSC-CMs showed marked arrhythmogenicity. 
This was manifested by the development of early-after depolarizations 
(EADs) in both the atrial-like and ventricular-like LQTS iPSC-CMs 
(Fig. 3e). EADs are spontaneous membrane depolarizations regarded 
as the harbinger of ventricular arrhythmias in LQTS’, and are thought 
to result from late membrane-inward currents, such as the L-type Ca?* 
‘window current'®. When reaching threshold, EADs may give rise to 
triggered action potentials, manifested in LQTS iPSC-CMs as isolated 
premature beats or even multiple sequential triggered action potentials 
(Fig. 3f, g). The presence of EADs was observed in 38 of the 58 LQTS 
iPSC-CMs studied (66%). These EADs gave rise to a single premature 
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Figure 2 | Phenotypic characterization of LQTS human iPSC-derived 
cardiac-tissue. a, Immunostaining of LQTS iPSC-CMs for a-actinin and TnI 
(top). Gap junctions (arrows) were identified by the punctuated connexion 43 
(Cx43) immunosignal (red) at the intercellular junctions (differential 
interference contrast microscopy (DIC) image, bottom right) between the 
cardiomyocytes (identified by «-actinin staining in green, bottom left). Scale 
bars, 30 um. b, Semi-quantitative polymerase chain reaction with reverse 
transcription (RT-PCR) demonstrating expression of cardiac-specific 
transcription factors (NKX2-5), sarcomeric proteins (MYH6, MYH7, MLC2V) 
and ion channels (KCNH2) by LQTS and control human iPSC-CMs in 
comparison to undifferentiated human iPSCs expressing OCT4 and NANOG. 
c, Microelectrode array extracellular recordings (middle) from LQTS human 


beat in 36% of the cells and in 28% also to sustained triggered activity. 
In contrast, recordings from control iPSC-CMs (n = 31) failed to show 
any EADs or any other arrhythmogenic activity. 

The arrhythmogenic activity of the LQTS iPSC-CMs was also 
identified at the multicellular level and ranged from isolated pre- 
mature beats (Fig. 2g, top, and Supplementary Movie 2) to more 
sustained activity (Fig. 2g, bottom). Consequently, ectopic activity 
in the microelectrode array recordings was noted in 38% (26 of 84) 
of the LQTS human iPSC-derived cardiac tissues studied but only in 
6% (3 of 47) of control samples. This ectopic activity was consistent 
with a triggered activity mechanism (although re-entry and automati- 
city could not be ruled out completely). 

Next we used the LOTS human iPSC model to evaluate drugs that 
may either ameliorate or aggravate the disease phenotype’’. Because 
malignant arrhythmias in LQTS patients are often precipitated by 
drugs that block Ix, we initially evaluated the specific Ix, blocker 
E-4031 (500nM), and noted significant APD prolongation (by 
47 +2%, P<0.01, n=7) in the LQTS iPSC-CMs (Supplementary 
Fig. 12B). This APD prolongation was coupled with increased 
arrhythmogenesis in 6 of the 7 cells studied, manifested either by 
the development of new EADs or by an increase in their number 
and complexity (Supplementary Fig. 12B). Similarly, Ix, blockade 
by E-4031 or by cisapride (a gastric prokinetic agent previously 
removed from the market because of its indirect Ix, blocking activity, 
which led to increased pro-arrhythmic mortality”’) significantly pro- 
longed cFPD and increased arrhythmogenicity also at the multi- 
cellular level (Supplementary Fig. 12C, D). These results, together 
with the Ix, current analysis, provide a measure of the magnitude 
of the dominant-negative effect induced by the A614V mutation. 
They also offer clues to the potential susceptibility of such LQTS 
patients to TdP in response to agents that may further inhibit the 
remaining Ix, current. More broadly it highlights the potential role of 
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iPSC cardiac tissue (top). The resulting colour-coded activation map (right) 
shows electrical propagation from the pacemaker area (red) to latest activation 
sites (blue). d, Positive chrontropic responses of LQTS hiPSCs-CMs (n = 16) to 
isoproterenol (101M). *P < 0.01 when compared to baseline. b.p.m., beats per 
minute. e, Extracellular recordings from control (top) and LQTS (bottom) 
human iPSC-derived cardiac tissues and their respective FPD measurements. 
f, Summary of rate-matched FPD (n = 12) and cFPD values (using Bazett’s (B), 
Hodges’ (H) or Fridericia’s'® (Fr) corrections) in control (n = 23) and LQTS 
human iPSC (n = 45) cardiac tissues. Error bars show s.e.m. *P < 0.05 and 
**P << 0.01. g, Arrhythmogenicity in LQTS cardiac tissue identified as single 
(top) or multiple (bottom, arrows) premature beats. 


the human iPSC model for safety pharmacology”, both for drug 
development and for patient-specific safety screening. 

Next we used the LQTS human iPSC-CM model to study agents that 
may possess novel therapeutic benefits for LQTS, through either APD 
shortening or by direct EAD suppression”’. The effect of these agents 
was also evaluated in healthy control cells (Supplementary Fig. 13). 
Because Ca** influx through L-type Ca** channels contributes 
importantly to APD and also has a role in EAD formation’, we pro- 
posed that inhibition of this current may be anti-arrhythmic’’. 
Application of a Ca* -channel blocker (nifedipine, 1 1M) led to sig- 
nificant APD abbreviation in LQTS cardiomyocytes (Fig. 4a, b, top), 
with mean APD, (the time interval required to reach 90% of 
repolarization) shortened by 57 + 7% (P< 0.01). More importantly, 
nifedipine application resulted in complete elimination of all EADs 
and triggered beats (Fig. 4c; n = 11). Similarly, nifedipine (1 uM) sig- 
nificantly shortened cFPD (Fig. 4a; P< 0.01, n = 5) and abolished all 
arrhythmic activity in the LQTS cardiac-tissue model (Fig. 4d). Long- 
term application of this agent, however, was associated with cessation 
of beating in some embryoid bodies. 

An alternative strategy to favourably alter the balance between inward 
and outward currents during repolarization in the LQTS may be to 
augment outward potassium currents. To this end we evaluated the 
effects of pinacidil (1 1M), a Kayp-channel opener, and noted significant 
shortening of both APDoy (Fig. 4a, b; P<0.05, n=4) and cFPD 
(Fig. 4a; P< 0.05, n = 9) in LQTS iPSC-CMs. Importantly, pinacidil 
application completely abolished all EADs and triggered arrhythmias 
in all LOTS iPSC-CMs studied (Fig. 4e, f; n = 7). A potential short- 
coming of the aforementioned therapies may be excessive APD short- 
ening leading to the potentially pro-arrhythmic ‘short QT syndrome’. 
In this respect, the human iPSC model may prove useful, as it may 
allow the identification of compounds and optimization of their dosage 
to result in APD normalization without pathological shortening. 
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Figure 3 | Whole-cell patch-clamp recording from human iPSC-CMs. 

a, Action-potential recordings from control and LQTS human iPSC-CMs 
showing ventricular-like, atrial-like and nodal-like morphologies. Notice the 
marked APD prolongation in both ventricular-like and atrial-like LQTS iPSC- 
CMs. b, APDop, APD79; and APDs9 values in LQTS (red) and healthy (blue) 
human iPSC-CMs. *P < 0.01 when compared to control cells. Error bars show 
s.e.m. ¢, Voltage-clamp recordings of the Jx, current, measured from control 
(left) and LQTS (right) human iPSC-derived ventricular cardiomyocytes. Top, 
baseline recordings. Middle, recording following administration of E-4031 


Lastly, we also evaluated the potential anti-arrhythmic efficacy of 
the late Na* -channel blocker, ranolazine”". Although, this agent may 
theoretically be more applicable for type-3 LQTS (resulting from gain-of- 
function Na‘ -channel mutations), a recent study showed its beneficial 
effects also in a canine model of acquired type-2 LQTS”. Interestingly, 
ranolazine application (15-50 uM) did not significantly alter APDoo or 
cFPD in LQTS cardiomyocytes (Fig. 4a and Supplementary Fig. 14; 
n= 8), probably because of its nonspecific blocking effect on different 
ion channels. Nevertheless, ranolazine induced a pronounced anti- 
arrhythmic effect at both the cellular and multicellular levels (Fig. 4g, h 
and Supplementary Fig. 14). 

Taken together, our data demonstrate the ability of human iPSC 
technology to model the abnormal functional phenotype ofan inherited 
cardiac disorder. Using this approach, the disease phenotype presented 
by type-2 LQTS patients at the bedside (long QT interval and the 
development of TdP) could be recognized as a measurable anomalous 
electrophysiological signature in bench studies using patient-specific 
human iPSC-CMs. Our study adds to a recent report demonstrating 
the ability to model type-1 LQTS using human iPSCs**. Importantly, 
the current study provides mechanistic insights into the pathogenesis of 
arrhythmias in this syndrome (by demonstrating the development of 
spontaneous EADs leading to triggered activity at both the cellular and 
multicellular levels). Moreover, it also highlights the potential of human 
iPSCs in the emerging field of personalized medicine by demonstrating 
the ability to screen the effects of potential disease aggravators and novel 
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(1 uM). Bottom, E-4031-sensitive current (Ix,) defined by digital subtraction of 
the two currents. d, Summary of the Jx, activation and tail-current amplitudes 
measured following test depolarization pulses of 0, 20 and 60 mV. Note the 
significant Ix, attenuation in LQTS cells (red, n = 6) when compared to 
controls (blue, n = 6). Error bars show s.e.m. e, Development of EADs in LQTS 
ventricular-like and atrial-like cells. f, g, Development of triggered activity in 
LQTS cardiomyocytes, manifested as EADs or even single and multiple 

(g) triggered beats. 


customized treatment options (which in LQTS include Ca”* -channel 
blockers, K,rp-channel openers, and late Na* -channel blockers). 

The results of the current study, focusing on congenital type-2 
LQTS, may have broader implications, as the specific current affected 
(Ixy) is thought to have an important role in the more commonly 
acquired LQTS forms (resulting from heart failure, cardiac hyper- 
trophy, or drug therapy). Moreover, the concepts described here 
may also be extended to model several other human genetic disorders, 
enabling translational research into disease mechanisms and therapies. 


METHODS SUMMARY 

Human iPSC generation and cardiomyocyte differentiation. Dermal fibroblasts 
were reprogrammed to generate human iPSC clones by transduction with retroviral 
vectors encoding SOX2, KLF4 and OCT4, followed by valproic acid treatment”. 
The undifferentiated human iPSC colonies were propagated on a MEF feeder layer. 
To induce differentiation, human iPSCs were dispersed into small cell clumps and 
cultivated in suspension for 10 days as embryoid bodies and then plated’>”. 
Immunostainings. Cells were fixed with 4% paraformaldehyde, permeabilized with 
1% Triton, blocked with 5% horse serum, and incubated with primary antibodies (as 
listed in Methods). The preparations were incubated with secondary antibodies and 
counterstained with 4’,6-diamidino-2-phenylindole (DAPI) for visualization. 
Whole-cell patch-clamp recordings. Dispersed human iPSC-CMs, obtained by 
enzymatic dissociation (collagenase, 1mgml~') of contracting embryoid bodies, 
were attached to fibronectin-coated glass coverslips. Action potentials were recorded 
at 32 °C in the current-clamp mode. A custom-written Matlab program was used 
to analyse action-potential parameters. Voltage-clamp studies were performed to 
characterize the Ix, current, defined as the E-4031 (1 LM) sensitive current. These 
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currents were elicited by 3-s depolarizing steps from a holding potential of —60 mV 
to potentials ranging from —40 to +60 mV in 10 mV increments. This was followed 
by a 2-s repolarization phase to —30 mV to elicit the tail current. 
Microelectrode array recordings. Extracellular electrograms, recorded by a high- 
resolution microelectrode array recording system (multichannel systems)’°, were 
used to determine FPD values in LQTS and control human iPSC-derived cardiac 
tissues. FPD was defined as the time interval between the initial field-potential 
deflection and its return to baseline. FPD measurements were either compared in 
rate-matched specimens or normalized (cFPD) for rate using Bazett’s, Hodges’ or 
Fridericia’s'® correction formulas. 

Statistical analysis. Results are reported as mean + s.e.m. Comparisons between 
LQTS human iPSC-CMs and healthy human iPSC-CMs groups were performed 
using the unpaired Student's t-test. Drug effects were analysed by paired t-test. 
One-way ANOVA followed by Tukey’s post-hoc tests were used when comparing 
multiple groups. P< 0.05 was considered statistically significant. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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cFPD with Bazett’s correction values following 
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Karp-channel opener pinacidil and late Na*- 
channel blocker ranolazine. *P < 0.05 when mean 
absolute values are compared to baseline. Error 
bars show s.e.m. b, APD shortening induced by 
nifedipine and pinacidil in LQTS iPSC-CMs. 
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(g, h). Arrows show the ectopic activity at baseline. 
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METHODS 


Establishment and cardiomyocyte differentiation of patient-derived iPSCs. 
Patient-derived human iPSC clones were established from the patient’s skin fibro- 
blasts by retroviral delivery of three reprogramming factors (SOX2, KLF4 and 
OCT4), followed by application of the histone deacetylase inhibitor valproic acid 
(VPA)™. Briefly, Moloney-based retroviral vectors (pMXs) containing human 
complementary DNAs (cDNAs) of OCT4, SOX2 and KLF4 (Addgene plasmids 
17964, 17218 and 17219, respectively) were used for retrovirus particle production. 
These plasmids were cotransfected with the helper plasmid encoding VSVG into 
HEK-293GP cells for virus production. Virus-containing media were collected at 
48 and 72h after transfection and used for two 24-h rounds of infection of the 
fibroblasts. Cells were then re-plated at a density of 1-2 X 10° cells per well on a 
MEF feeder layer, cultured in ES medium and treated with 0.9 mM VPA for 14d. 
Several human iPSC clones, which morphologically resembled human ES cells 
and were positively stained with vital TRA-1-81 or TRA-1-60 staining”, were 
selected and expanded for further characterization. The undifferentiated human 
iPSC colonies were cultured on mitotically inactivated mouse embryonic fibro- 
blast (MEF) cells as previously described'?*”°. Y-27632 (5 1M), a ROCK inhibitor, 
was added in early passages to increase seeding efficiency. The culture medium 
consisted of 80% knockout high-glucose glutamine-free DMEM with sodium 
pyruvate supplemented with 20% serum replacement, 1mM _ 1-glutamine, 
0.1mM mercaptoethanol, 4ng ml’ human recombinant basic fibroblast growth 
factor (bFGF), and 1%-nonessential amino acid stock (Invitrogen). 
Cardiomyocyte differentiation was induced using the embryoid body differenti- 
ating system as previously described in detail’*”®. Undifferentiated human iPSCs 
were dispersed into small cell clumps using 300Uml ' of collagenase IV 
(Worthington) for 45 min. These cells were then cultivated in suspension for 
10 days in the differentiation medium (knockout DMEM supplemented with 
20% FBS (Hyclone), nonessential amino acids, §-mercaptoethanol and glutamine) 
where they aggregated to form embryoid bodies. The embryoid bodies were then 
plated on 0.1% gelatin-coated culture dishes, cultured with the same differenti- 
ation medium, and examined daily for the appearance of spontaneous contrac- 
tions. At 30 days after plating, contracting areas within the embryoid bodies were 
carefully microdissected and used for the phenotypic characterization studies. 
We studied cardiomyocytes derived from three different LQTS human iPSC 
clones (generated independently from the patient’s fibroblasts). They were com- 
pared with cardiomyocytes derived from a healthy control human iPSC line 
generated using the same methods. We also used a second control human iPSC 
line that was well-characterized previously'*””. 
Genomic sequencing. Genomic DNA was isolated from the patient-derived human 
iPSC colonies using the high-pure PCR template preparation kit (Roche). The rel- 
evant DNA fragment of the KCNH2 gene was amplified by PCR reaction using 
100 ng genomic DNA (primer sequences are detailed in Supplementary Table 2). 
PCR products were then sequenced. 
Immunofluresence and alkaline phosphatase staining. Colonies of undifferenti- 
ated human iPSCs and single cardiomyocytes obtained by dissociating beating embry- 
oid bodies were fixed with 4% paraformaldehyde, permeabilized with 1% Triton 
(Sigma) and blocked with 5% horse serum. Specimens were incubated overnight at 
4°C with primary antibodies targeting OCT4, Tra-1-60, CX43 (from Santa Cruz), 
NANOG (Peprotech), SSEA4 (R&D), nestin (Millipore), «-fetoprotein (Cell Marque), 
desmin (Thermo), troponin I (TnI; Chemicon) and sarcomeric «-actinin (Sigma). The 
preparations were incubated with secondary antibodies at a dilution of 1:200 for 1h. 
Nuclei were counterstained with DAPI (1:500, Sigma). Preparations were examined 
using a laser-scanning confocal microscope (Zeiss LSM-510-PASCAL). Alkaline 
phosphatase activity was detected in live cultures using the alkaline phosphatase 
detection kit (Sigma) according to the manufacturer’s instructions. 
Teratoma formation. Undifferentiated human iPSCs were obtained by collagenase 
IV dissociation and injected subcutaneously into NOD/SCID mice. Palpable 
tumours were observed 4-6 weeks after injection. Tumour samples were collected 
at 6-8 weeks, cryo-preserved, processed by 10-t1m cryo-sectioning, and stained with 
haematoxylin and eosin. 
Karyotype analysis. Karyotype analysis was performed using standard G-banding 
chromosome analysis by the institution’s cytogenetic laboratory according to 
standard procedures. 
Bisulphite sequencing. Genomic DNA (1 1g) was bisulphite converted with the 
Methylamp DNA Modification kit (Epigenetek) according to the manufacturer's 
instructions. After bisulphite conversion, DNA was amplified using Faststart Taq 
polymerase (Roche) as follows: 94 °C for 4 min; 5 cycles of 94 °C for 30 s, 59 °C for 
3 min, 72°C for 3 min; 35 cycles of 94°C for 30s, 62°C for 30s, 72°C for 30s; 
72°C for 10 min. Primer sequences can be found in Supplementary Table 2. PCR 
products were TA-cloned into pTZ57R/T plasmid (Fermentas). Inserts were 
sequenced with M13 universal primers. 


Gene expression analysis. Undifferentiated human iPSCs and differentiating 
embryoid bodies were frozen in liquid nitrogen. RNA was isolated using the 
RNeasy-plus mini-kit (Qiagen-Gm). Reverse transcription into cDNA was con- 
ducted using the high-capacity cDNA reverse transcription kit (Applied 
Biosystems). The PCR-related primers are detailed in Supplementary Table 2. 
Briefly, each RT-PCR included the following PCR program: 3 min at 93°C, 30s 
at 93 °C, 30 s at 60 °C, and 30s at 72 °C. 2.5 ng of CDNA was used from each sample. 

SYBR-Green real-time PCR studies were performed using the Fast SYBR Green 

Master mix (Applied Biosystems) and primers (detailed in Supplementary Table 
3). All real-time PCR experiments were conducted in triplicates. Samples were 
cycled 40 times using a Fast ABI-7500 Sequence Detector (Applied Biosystems). 
ABI-7500 cycle conditions were as follows: 2 min at 50°C, 15 min at 95 °C fol- 
lowed by 40 cycles of 15s at 95 °C, 30s at 60°C and 30s at 72 °C. Cycle threshold 
(CT) was calculated under default settings for real-time sequence detection soft- 
ware (Applied Biosystems). 
Whole-cell patch-clamp recordings. Spontaneously contracting embyoid bodies 
were mechanically isolated, enzymatically dispersed into single cells (1mgml ! 
collagenase B; Roche) and attached to fibronectin-coated glass coverslips. Action 
potentials were recorded from spontaneously contracting small clusters super- 
fused with Tyrode solution at 32°C under spontaneous and paced (0.5-1 Hz) 
rhythms. The Tyrode solution consisted of NaCl (140mM), KCl (5.4mM), 
CaCl, (1.8mM), MgCl, (1 mM), HEPES (10 mM) and glucose (10 mM), pH 7.4. 
The pipette solution consisted of KCl (120 mM), MgCl (1 mM), Mg-ATP (3 mM), 
HEPES (10 mM) and EGTA (10 mM), pH, 7.2. Action potentials were recorded 
using the current-clamp mode using Axopatch 200B, Digidata 1322A, and pClamp 
9 (Axon Instruments) for data amplification, acquisition and analysis, respectively. 
A custom-written computer program was used to measure APDs59, APD79 and 
APDogp (the time intervals required to reach 50%, 70% and 90% of repolarization), 
dV/dt max» repolarization velocity and action-potential amplitude. 

For voltage-clamp studies of Ix, external Na* was replaced by equimolar choline 
(126 mM) and the solution was supplemented by 4-AP (5 mM), BaCl, (0.5 mM), 
CdCl, (0.2 mM) and chromanol (10 11M) to suppress potential interference of Ix, 
Thos Txt Ica and Ix,, respectively. Ix, was defined as the E-4031-sensitive (1 1M) 
current and was elicited by 3-s depolarizing steps from a holding potential of 
—60 mV to potentials ranging from —40 to +60 mV in 10 mV increments. This 
was followed by a 2-s repolarization phase to —30 mV to elicit tail current. 
Microelectrode array recordings. A high-resolution microelectrode array 
recording system (Multichannel Systems)”* was used to characterize the electro- 
physiological properties of human iPSC-CMs. The contracting areas within the 
embryoid bodies were microdissected and plated on fibronectin-coated micro- 
electrode array plates. The recorded extracellular electrograms were used to deter- 
mine local FPD, defined as the time interval between the initial deflection of the 
field potential to its return to baseline. FPD measurements were either compared 
in rate-matched specimens or were normalized (cFPD) to the activation rate using 
Bazett’s, Hodges’ or Fridericia’s'® correction formulae. For Bazett’s correction, 
cFPD-B = FPD/(RR)'””; for Hodges’ correction, cFPD-H = FPD + 0.105 x 
((1/RR) — 1); for Fridericia’s correction, CFPD-Fr = FPD/(RR)"”, where RR indi- 
cates the time interval (in seconds) between two consecutive beats. 

To assess for the effects of different drugs on the electrophysiological properties 
of the cells being studied, between 5-30 1] of tested drug stock solution was added 
to 1ml of medium to achieve the necessary concentration. The tested drugs 
included isoproterenol hydrochloride, nifedipine, ranolazine dihydrochloride, 
pinacidil monohydrate (all from Sigma), cisapride (Janssen-Cilag) and E-4031 
(Alomone Labs). Preliminary studies (data not shown) demonstrated that 
DMSO (solvent for nifedipine and pinacidil) alone did not have any effect on 
the measured electrophysiological parameters. Extracellular recordings were per- 
formed for 60s at baseline and after 5 min following drug application. 
Statistical analysis. Results are reported as mean + s.e.m. Comparison between 
LQTS human iPSC-CMs and control healthy human iPSC-CMs groups was per- 
formed using the unpaired Student’s t-test. Drug effects on the cells were analysed 
by paired t-test. One-way ANOVA followed by Tukey’s post-hoc tests were used 
when comparing multiple groups. P < 0.05 was considered statistically significant. 
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